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<160> 16 

15 
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<210> 1 

20 <211> 1160 

<212> DNA 

<213> Rhodosporidium toruloides 



25 



<220> 
<221> 
<222> 
<223> 



CDS 

(1) . . (1104) 
coding for DAAO 



<400> 1 
30 atg cac teg cag 
Met His Ser Gin 
1 

ctg age age gec 
Leu Ser Ser Ala 
35 20 
etc gcg cgc gac 
Leu Ala Arg Asp 
35 

cca tgg get ggc 
40 Pro Trp Ala Gly 
50 

cct cga caa gca 
Pro Arg Gin Ala 
65 

45 ttg gtc ccg acg 
Leu Val Pro Thr 



aag cgc gtc gtt 
Lys Arg Val Val 
5 

etc ate etc get 
Leu lie Leu Ala 



ttg 
Leu 

gcg 
Ala 

aaa 
Lys 

ggc 
Gly 
85 



ccg 
Pro 

aat 
Asn 

tgg 

Trp 

70 

cat 

His 



gag gac 
Glu Asp 
40 

tgg acg 
Trp Thr 
55 

gaa gaa 
Glu Glu 

gee atg 
Ala Met 



gtc 
Val 

egg 

Arg 

25 

gtc 

Val 

cct 
Pro 

teg 
Ser 

tgg 
Trp 



etc gga 
Leu Gly 
10 

aag ggc 
Lys Gly 

teg age 
Ser Ser 

ttc atg 
Phe Met 

act ttc 
Thr Phe 
75 

etc aag 
Leu Lys 
90 



tea 
Ser 

tac 
Tyr 

cag 
Gin 

acg 

Thr 

60 

aag 

Lys 

ggg 

Gly 



ggc 
Gly 

age 
Ser 

act 

Thr 

45 

ctt 

Leu 



gtt ate 
Val He 
15 

gtg cat 
Val His 
30 

ttc get 
Phe Ala 

aca gac 
Thr Asp 



ggt 
Gly 

att 
He 

tea 
Ser 

ggt 
Gly 



aag tgg gtc gag 
Lys Trp Val Glu 
80 

acg agg egg ttc 
Thr Arg Arg Phe 
95 



48 



96 



144- 



192 



240 



288 
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gcg cag aac gaa gac ggc ttg etc ggg cac tgg tac aag gac ate acg 336 
Ala Gin Asn Glu Asp Gly Leu Leu Gly His Trp Tyr Lys Asp lie Thr 

100 105 110 

cca aat tac cgc ccc etc cca tct tec gaa tgt cca cct ggc get ate 38 4 

5 Pro Asn Tyr Arg Pro Leu Pro Ser Ser Glu Cys Pro Pro Gly Ala lie 
115 120 125 

ggc gta acc tac gac ace etc tec gtc cac gca cca aag tac tgc cag 432 
Gly Val Thr Tyr Asp Thr Leu Ser Val His Ala Pro Lys Tyr Cys Gin 
130 135 140 

10 tac ctt gca aga gag ctg cag aag etc ggc gcg acg ttt gag aga egg 480 
Tyr Leu Ala Arg Glu Leu Gin Lys Leu Gly Ala Thr Phe Glu Arg Arg 
145 150 155 160 

acc gtt acg teg ctt gag cag gcg ttc gac ggt gcg gat ttg gtg gtc 528 
Thr Val Thr Ser Leu Glu Gin Ala Phe Asp Gly Ala Asp Leu Val Val 
15 165 170 175 

aac get acg gga ctt ggc gee aag teg att gcg ggc ate gac gac caa 57 6 

Asn Ala Thr Gly Leu Gly Ala Lys Ser lie Ala Gly lie Asp Asp Gin 

180 185 190 

gec gee gag cca ate cgc -ggg caa acc gtc etc gtc aag tec cca tgc 624 
20 Ala Ala- Glu Pro lie Arg Gly Gin Thr Val Leu Val Lys Ser Pro Cys 
195 200 205 

aag cga tgc acg atg gac teg tec gac. ccc. get tct ccc gee tac ate 672 
Lys Arg Cys Thr Met Asp Ser Ser Asp Pro Ala Ser Pro Ala Tyr lie 
210 • 215 220 

25 att ccc cga cca ggt ggc gaa gtc ate tgc ggc ggg acg tac ggc gtg • 720 

lie Pro Arg Pro Gly Gly Glu Val lie Cys Gly Gly Thr Tyr Gly Val 
225 230 * 235 240 

gga gac tgg gac ttg tct gtc aac cca gag acg gtc cag egg ate etc 7^8 
Gly Asp Trp Asp Leu Ser Val Asn Pro Glu Thr Val Gin Arg lie Leu 
30 245 250 ... 255 

aag cac tgc ttg cgc etc gac ccg acc ate teg age gac gga acg ate 816 
Lys His Cys Leu Arg Leu Asp Pro Thr lie Ser Ser Asp Gly Thr lie 

260 265 270 

gaa ggc ate gag gtc etc cgc cac aac gtc ggc ttg cga cct gca cga , 8 64 

35 Glu Gly lie Glu Val Leu Arg His Asn Val Gly Leu Arg Pro Ala Arg 
275 280 285 

cga ggc gga ccc cgc gtt gag gca gaa egg ate gtc ctg cct etc gac 912 
Arg Gly Gly Pro Arg Val Glu Ala Glu Arg lie Val Leu Pro Leu Asp 
290 295 300 

40 egg aca aag teg ccc etc teg etc ggc agg ggc age gca cga gcg gcg 960 
Arg Thr Lys Ser Pro Leu Ser Leu Gly Arg Gly Ser Ala Arg Ala Ala 
305 310 315 320 

aag gag aag gag gtc acg ctt gtg cat gcg tat ggc ttc teg agt gcg 1008 
Lys Glu Lys Glu 'Val Thr Leu Val His Ala Tyr Gly Phe Ser Ser Ala 
45 325 330 335 

gga tac cag cag agt tgg ggc gcg gcg gag gat gtc gcg cag etc gtc 1056 
Gly Tyr Gin Gin Ser Trp Gly Ala Ala Glu Asp Val Ala Gin Leu Val 
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340 345 350 

gac gag gcg ttc cag egg tac cac ggc gcg gcg egg gag teg aag ttg 1104 

Asp Glu Ala Phe Gin Arg Tyr His Gly Ala Ala Arg Glu Ser Lys Leu 

355 • 360 365 

5 tagggeggga tttgtggctg tattgeggge atctacaaga aaaaaaaaaa aaaaaa 1160 

<210> 2 

<211> 368 

<212> PRT 

10 <213>' Rhodosporidium toruloides 



<400> 2 

Met His Ser Gin Lys Arg Val Val Val Leu Gly Ser Gly Val lie Gly 
15 10 15 

15 

Leu Ser Ser Ala Leu lie Leu Ala Arg Lys Gly Tyr Ser Val His lie 
20 25 30 

Leu Ala Arg Asp Leu Pro Glu Asp Val Ser Ser Gin Thr Phe Ala Ser 
20 35 40 45 

Pro Trp Ala Gly Ala Asn Trp Thr Pro Phe Met Thr Leu Thr Asp Gly 
50 55 60 

25 Pro Arg Gin Ala Lys Trp Glu Glu Ser Thr Phe Lys Lys Trp Val Glu. 
65 70 75 80 

Leu Val Pro Thr Gly His Ala Met Trp Leu Lys Gly Thr Arg Arg Phe 
85 90 95 

30 

Ala Gin Asn Glu Asp Gly Leu Leu Gly His Trp Tyr Lys Asp lie Thr 
10-0 105 110 

Pro Asn Tyr Arg Pro Leu Pro Ser' Ser Glu Cys Pro Pro Gly Ala lie 
35 115 120 125 



Gly Val Thr Tyr Asp 
130 

40 Tyr Leu Ala Arg Glu 
145 

Thr Val Thr Ser Leu 
16'5 

45 

Asn Ala Thr Gly Leu 
180 



Thr Leu Ser Val His Ala 
135 

Leu Gin Lys Leu Gly Ala 
150 155 

Glu Gin Ala Phe Asp Gly 
170 

Gly Ala Lys Ser lie Ala 
185 



Pro Lys Tyr Cys Gin 
140 

Thr Phe Glu Arg Arg 
160 

Ala Asp Leu Val Val 
175* 

Gly lie Asp Asp Gin 
190 
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Ala Ala Glu Pro He Arg Gly Gin 
195 200 

5 Lys Arg Cys Thr Met Asp Ser Ser 
210 215 

He Pro Arg Pro Gly Gly Glu Val 
225 230 

10 

Gly Asp Trp Asp Leu Ser Val Asn 
245 



Thr Val Leu Val Lys Ser Pro Cys 
205 

Asp Pro Ala Ser Pro Ala Tyr He 
220 

He Cys Gly Gly Thr Tyr Gly Val 
235 240 

Pro Glu Thr Val Gin Arg He Leu 
250 255 



Lys His Cys 

15 

Glu Gly He 
275 

20 Arg Gly Gly 

290 

Arg Thr Lys 
305 

25 

Lys Glu Lys 
Gly Tyr Gin 

30 

Asp Glu Ala 
355 



Leu Arg Leu Asp 
260 

Glu Val Leu Arg 



Pro Arg Val Glu 
295 

Ser Pro Leu Ser 
310 

Glu Val Thr Leu 
325 

Gin Ser Trp Gly 
340 

Phe Gin Arg Tyr 



Pro Thr He Ser Ser 
265 

His Asn Val Gly Leu 
280 

Ala Glu Arg He Val 
300 

Leu Gly Arg Gly Ser 
315 

Val His Ala Tyr Gly 
330 

Ala Ala Glu Asp Val 
345 

His Gly Ala Ala Arg 
360 



Asp Gly Thr He 
270 

Arg Pro Ala Arg 
285 

Leu Pro Leu Asp 



Ala Arg Ala Ala 
320 

Phe Ser Ser Ala 
335 

Ala Gin Leu Val 
350 

Glu Ser Lys Leu 
365 



35 <210> 3 

<211> 1005 

<212> DNA 

<213> Caenorhabditis elegans 



40 <220> 

<221> CDS 

<222> (1)..(1002) 

<223> coding for DAAO 



45 



<400> 3 

atg gca aac ata att ccg aag att gca att ate ggc gaa gga gtc att 
Met Ala Asn He He Pro Lys He Ala He He Gly Glu Gly Val He 



48 



144 



192 
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15 10 15 

gga tgt act tea gca ctt caa at a tea aaa get ata cca aat gcg aaa 96 
Gly Cys Thr Ser Ala Leu Gin lie Ser Lys Ala lie Pro Asn Ala Lys 
20 25 30 

5 ata act gtg etc cac gat aaa cca ttt aaa aaa teg tgc agt gca gga 
He Thr Val Lea His Asp Lys Pro Phe Lys Lys Ser Cys Ser Ala Gly 

35 40 45 

cca gca gga tta ttt aga ate gat tat gag gag aat act gaa tac gga 
Pro Ala Gly Leu. Phe Arg He Asp Tyr Glu Glu Asn Thr Glu Tyr Gly 
10 50 55 60 

cgt get tct ttc gec tgg ttc tea cat etc tat cgc act aca aaa gga 240 
Arg Ala Ser Phe Ala Trp Phe Ser His Leu Tyr Arg Thr Thr Lys Gly 
65 70 ' 75 80 

tec gaa ace ggc gtg aaa tta gtt tct gga cat att caa tec gac aac 288 
15 Ser Glu Thr Gly Val Lys Leu Val Ser Gly His He Gin Ser Asp Asn 

85 90 95 

ttg gag tea ttg aag caa caa caa aga gec tat ggc gat att gtg tac 336 
Leu Glu Ser Leu. Lys Gin Gin Gin Arg Ala Tyr Gly Asp He Val Tyr 
10O 105 110 

20 aac ttt aga ttc ttg gat gat aga gaa egg ctg gac att ttt ccc gaa 38 4 

Asn Phe Arg Phe Leu Asp Asp Arg Glu Arg Leu Asp He Phe Pro Glu 

115 120 125 

cca tea aag cac tgc att cac tac ace gec tac gca tea gaa ggt aac 432 
Pro Ser Lys His Cys He His Tyr Thr Ala Tyr Ala Ser. Glu Gly Asn 
25 130 135 140 

aag tac gtg cct tat ttg aag aat ttg ctg ctt gag caa aaa ate gag 
Lys Tyr Val Pro Tyr Leu Lys Asn Leu Leu Leu Glu Gin Lys He Glu 
145 150 155 160 

ttc aag caa caa. gaa gtg acg agt ttg gac gca gtc gee gac get ggt 
30 Phe Lys Gin Gin Glu Val Thr Ser Leu Asp Ala Val Ala Asp Ala Gly 

r 165 170 175 

tac gat gtt att: gta aac tgc gca ggc ttg tac ggt gga aag ttg get 57 6 

Tyr Asp Val He Val Asn Cys Ala Gly Leu Tyr Gly Gly Lys Leu Ala 
180 185 190 

35 ggt gat gac gat act tgc tac ccc att aga gga gtc att ttg gaa gtt 624 
Gly Asp Asp Asp Thr Cys Tyr Pro He Arg Gly Val He Leu Glu Val 

195 200 205 

gat gca cca tgcj cac aag cac ttc aat tat cga gac ttt act act ttc 672 
Asp Ala Pro Trp His Lys His Phe Asn Tyr Arg Asp Phe Thr Thr Phe 
40 210 215 220 

aca att cca aaa. gag cac age gtg gtg gtt ggg tec acc aag cag gac 720 
Thr He Pro Lys Glu His Ser Val Val Val Gly Ser Thr Lys Gin Asp 
225 230 235 240 

aat cga tgg gat ttg gag ate acc gac gag gat' aga aat gat att ttg ' 7 68 

45 Asn Arg Trp Asp Leu Glu He Thr Asp Glu Asp Arg Asn Asp He Leu 

245 250 255 

aaa cga tac att get tta cat cct gga atg aga gag cca aag att ate 816 



480 



528 
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Lys Arg Tyr lie Ala Leu His Pro Gly Met Arg Glu Pro Lys lie lie 

260 265 270 

aaa gaa tgg tea gca ctt cgc ccg gga cgt aag cat gtc aga att gaa 8 64 

Lys Glu Trp Ser Ala Leu Arg Pro Gly Arg Lys His Val Arg lie Glu 
5 275 280 285 

gcg cag aag agg aca tct gtt gga aac tea aaa gat tat atg gtt gtg 912 
Ala Gin Lys Arg Thr Ser Val Gly Asn Ser Lys Asp Tyr Met Val Val 

290 295 300 

cat cac tat ggt cac ggg age aac gga ttc acg ttg ggt tgg gga aca 960 
10 His His Tyr Gly His Gly Ser Asn Gly Phe Thr Leu Gly Trp Gly Thr 
305 310 315 320 

gca att gaa gca act aaa ctt gtt aag act gca eta gga tta taa 1005 
Ala lie Glu Ala Thr Lys Leu Val Lys Thr Ala Leu Gly Leu 
325 330 

15 

<210> 4 
<211> 334 
<212> PRT 

<213> Caenorhabditis elegans 

20 

<400> 4 

Met Ala Asn He He Pro Lys lie Ala He lie Gly Glu Gly Val He 
1 5 10 15 

25 Gly Cys Thr Ser AJ_a Leu Gin lie Ser Lys Ala He Pro Asn Ala Lys 
20 25 30 

He Thr Val Leu His Asp Lys Pro Phe Lys Lys Ser Cys Ser Ala Gly 
35 • 40 45 



30 



45 



Pro Ala Gly Leu Phe Arg lie Asp Tyr Glu Glu Asn Thr Glu Tyr Gly 
50 55 60 



Arg Ala Ser Phe Ala Trp Phe Ser His Leu Tyr Arg Thr Thr Lys Gly 
35 65 70 75 80 

Ser Glu Thr Gly Val Lys Leu Val Ser Gly His He Gin Ser Asp Asn 
85 90 95 

40 Leu Glu Ser Leu Lys Gin Gin Gin Arg Ala Tyr Gly Asp He Val Tyr 
100 105 110 

Asn Phe Arg Phe Leu Asp Asp Arg Glu Arg Leu Asp He Phe Pro Glu 
115 ' 120 * 125 



Pro Ser Lys His Cys He His Tyr Thr Ala Tyr Ala Ser Glu Gly Asn 
130 135 140 
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Lys Tyr Val Pro Tyr Leu Lys Asn Leu Leu Leu Glu Gin Lys lie Glu 
145 150 155 160 

5 Phe Lys Gin Gin Glu Val Th.r Ser Leu Asp Ala Val Ala Asp Ala Gly 

165 170 175 ' 

Tyr Asp Val lie Val Asn Cys Ala Gly Leu Tyr Gly Gly Lys Leu Ala 
180 185 190 

10 

Gly Asp Asp Asp Thr Cys Tyr Pro lie Arg Gly Val lie Leu Glu Val 
195 200 205 

Asp Ala Pro Trp His Lys H±s Phe Asn Tyr Arg Asp Phe Thr Thr Phe 
15 210 215 220 

Thr lie Pro Lys Glu His Ser Val Val Val Gly Ser Thr Lys Gin Asp 
225 230 235 240 

20 Asn Arg Trp Asp Leu Glu He Thr Asp Glu Asp Arg Asn Asp He Leu 

245 250 255 

Lys Arg Tyr He Ala Leu H±s Pro Gly Met Arg Glu Pro Lys He He 
260 265 270 

25 

Lys Glu Trp Ser Ala Leu Arg Pro Gly Arg Lys His Val Arg He Glu 
275 280 285 

Ala Gin Lys Arg Thr Ser Val Gly Asn Ser Lys Asp Tyr Met Val Val 
30 290 295 300 

His His Tyr Gly His Gly Ser Asn Gly Phe Thr Leu Gly Trp Gly Thr 
305 310 315 320 

35 Ala He Glu Ala Thr Lys Leu Val Lys Thr Ala Leu Gly Leu 

325 330 

<210> 5 
<211> 1186 . 
40 <212> DNA 

<213> Nectria haematococca 

<220> 

<221> " CDS 
45 <222> (42) . . (1124) 

<223> coding for DAAO 
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<400> 5 

agcgacttga atttagcgaa aagaacttgt caaccacaat c atg tec aac aca ate 5 6 

5 Met Ser Asn Thr lie 

1 .5 

gtc gtc gtt ggt gec ggt gtc att ggc ttg acg teg gec ttg ttg etc 104 
Val Val Val Gly Ala Gly Val He Gly Leu Thr Ser Ala Leu Leu Leu 
10 15 20 

10 tec aag aac aag ggc aac aag ate acc gtc gtg gee aag cac atg ccc 152 
Ser Lys Asn Lys Gly Asn Lys He Thr Val Val Ala Lys His Met Pro 

25 30 35 

ggc gac tat gac gtt gaa tac gec teg cct ttt get ggt gec aac cac 200 
Gly Asp Tyr Asp Val Glu Tyr Ala Ser Pro Phe Ala Gly Ala Asn His 

15 40 45 50 

tec ccc atg gcg acg gaa gag age age gaa tgg gaa cgt cgc act tgg , 248 
Ser Pro Met Ala Thr Glu Glu Ser Ser Glu Trp Glu Arg Arg Thr Trp 

55 60 65 

tac gag ttt aag aga ctg gtc gag gag gtc cct gag gee ggt gtt cat 296 

20 Tyr Glu Phe Lys Arg Leu Val Glu Glu Val Pro Glu Ala Gly Val His 
70 75 80 ■ 85 

ttc cag aag tct cgc ate cag agg cgc aat gtg gac act gaa aag gcg 34 4 

Phe Gin Lys Ser Arg He Gin Arg Arg Asn Val Asp Thr Glu Lys Ala 
90 95 100 

25 cag agg tct ggt ttc cca gac gec etc ttc teg aaa gaa ccc tgg ttc 392 
Gin Arg Ser Gly Phe Pro Asp Ala Leu Phe Ser Lys Glu Pro Trp Phe 

105 110 115 

aag aac atg ttt gag gac ttc egt gag cag cac cct age gag gtc ate 4 40 

Lys Asn Met Phe Glu Asp Phe Arg Glu Gin His Pro Ser Glu Val He 

30 120 125 130 

ccc ggt tac gac tct ggc tgc gag ttc aca teg gtg tge ate aac acg 4 88 

Pro Gly Tyr Asp Ser Gly Cys Glu Phe Thr Ser Val Cys He Asn Thr 

135 140 145 

gee ate tac etc ccc tgg etc etc ggc cag tgc ate aag aat ggc gtc . 536 

35 Ala He Tyr Leu Pro Trp Leu Leu Gly Gin Cys He Lys Asn Gly Val 
150 155 i60 165 

ate gtc aag cgc gee ate etc aac gac att age gag gec aag aag ctg 584 
He Val Lys Arg Ala He Leu Asn Asp He Ser Glu Ala Lys Lys Leu 
170 175 180 

40 age cac gcg ggc aag acg ccc aat ate ate gtc aac gee acg ggt etc 632 
Ser His Ala Gly Lys Thr Pro Asn He He Val Asn Ala Thr Gly Leu 

185 190 195 

ggc tec tac aag ctg ggc ggt gtc gag gac aag acc atg gcg cct gcg 680 
Gly Ser Tyr Lys Leu Gly Gly Val Glu Asp Lys Thr Met Ala Pro Ala 

45 200 205 • 210 

egg gga cag att gtg gtt gtg cgc aac gag age age ccc atg etc etc 728 
Arg Gly Gin He Val Val Val Arg Asn Glu Ser Ser Pro Met Leu Leu 
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215 220 225 

act tea ggt gtc gag gac ggc ggt get gat gtc atg tac ttg atg cag 77 6 

Thr Ser Gly Val Glu Asp Gly Gly Ala Asp Val Met Tyr Leu Met Gin 
230 235 - 240 245 

5 cga gca get ggc ggt ggc acc ate ctg ggc ggt ace tac gac gtt ggc 824 
Arg Ala Ala Gly Gly Gly Thr lie Leu Gly Gly Thr Tyr Asp Val Gly 

250 255 260 

aac tgg gag tct cag cca gac ccc aac ate gcg aat cgc ate atg cag 872 
Asn Trp Glu Ser Gin Pro Asp Pro Asn lie Ala Asn Arg lie Met Gin 
10 265 270 275 

cgc ate gtc gag gtg egg ccc gag att gec aac ggc aag ggc gtc aag 920 
Arg lie Val Glu Val Arg Pro Glu lie Ala. Asn Gly Lys Gly Val Lys 

280 285 290 

ggg ctg age gtg ate cga cac gee gtc ggc atg egg ccg tgg cga aag 968 
15 Gly Leu Ser Val lie Arg His Ala Val Gly Met Arg Pro Trp Arg Lys 
295 300 305 

gac gga gtc agg ate gag gag gag aag ctg gat gat gag act tgg ate 1016 
Asp Gly Val Arg lie Glu Glu Glu Lys Leu. Asp Asp Glu Thr Trp lie 
310 315 • 320 325 ■ 

20 gtg cac aac tac gga cac tct gga tgg ggt tac cag ggt teg tat ggt 1064 
Val His Asn Tyr Gly His Ser Gly Trp Gly Tyr Gin Gly Ser Tyr Gly 

330 335 340 

tgt get gag aat gta gtc cag ttg gtt gac aag gtc ggc aag gcg gee 1112 
Cys Ala Glu Asn Val Val Gin Leu Val Asp Lys Val Gly Lys Ala Ala 
25 345 350 355 

aag tct aag ctg tagttgaaaa ggcctgaatg agtaatagta attggatatt 1164 
Lys Ser Lys Leu 
360 

ggaaataccg tatttgeect eg 118 6 

30 

<210> 6 
<211> 361 
<212> PRT 

<213> Nectria haematococca 

35 

<400> 6 

Met Ser Asn Thr lie Val Val Val Gly Ala. Gly Val He Gly Leu Thr 
1 ' 5 10 15 

40 Ser Ala Leu Leu Leu Ser Lys Asn Lys Gly Asn Lys He Thr Val Val 
20 25 30 

Ala Lys His Met Pro Gly Asp Tyr Asp VaX Glu Tyr Ala Ser Pro Phe 
35 ' 40 ' 45 ' 



45 



Ala Gly Ala Asn His Ser Pro Met Ala Thr: Glu Glu Ser Ser Glu Trp 
50 55 ' 60 
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Glu Arg Arg Thr Trp Tyr Glu Phe Lys Arg Leu Val Glu Glu Val Pro 
65 70 75 80 

5 Glu Ala Gly Val His Phe Gin Lys Ser Arg lie Gin Arg Arg Asn Val 

85 90 95 

Asp Thr Glu Lys Ala Gin Arg Ser Gly Phe Pro Asjp Ala Leu Phe Ser 
100 105 110 

10 

Lys Glu Pro Trp Phe Lys Asn Met Phe Glu Asp Phe Arg Glu Gin His 
115 120 125 

Pro Ser Glu Val lie' Pro Gly Tyr Asp Ser Gly Cys Glu Phe Thr Ser 
15 130 135 14 0 

Val Cys lie Asn Thr Ala lie Tyr Leu Pro Trp Lexi Leu Gly Gin Cys 
145 150 155 160 

20 lie Lys Asn Gly Val He Val Lys Arg Ala He Leu Asn Asp He Ser 

165 170 175 

Glu Ala Lys Lys Leu Ser His. Ala Gly Lys Thr Pro Asn He He Val 
180 ' 185 190 

25 

Asn Ala Thr Gly Leu Gly Ser Tyr Lys Leu Gly Gly Val Glu Asp Lys 
195 200 205 

Thr Met Ala Pro Ala 'Arg Gly Gin He Val- Val Val Arg Asn Glu Ser 
30 210 215 22 0 

Ser Pro Met -Leu Leu Thr Ser Gly Val Glu Asp Gly Gly Ala Asp Val 
225 230 235 240 

35 Met Tyr Leu Met Gin Arg Ala Ala Gly Gly Gly Thr He Leu Gly Gly 

245 250 255 

Thr Tyr Asp Val Gly Asn Trp Glu Ser Gin Pro Asp Pro Asn He Ala 
260 265 270 

40 

Asn Arg He Met Gin Arg He Val Glu Val Arg Poro Glu He Ala Asn 
275 280 285 

Gly Lys Gly Val Lys Gly Leu Ser Val He Arg His Ala Val Gly Met 
45 290 295 300 



Arg Pro Trp Arg Lys Asp Gly Val Arg He Glu Glu Glu Lys Leu Asp 
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315 
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320 



Asp Glu Thr Trp lie Val His Asn Tyr Gly His Ser Gly Trp Gly Tyr 
325 330 335 

5 

Gin Gly Ser Tyr Gly Cys Ala Glu Asn Val Val Gin Leu Val Asp Lys 
340 345 350 

Val Gly Lys Ala Ala Lys Ser Lys Leu 
10 355 360 

<210> 7 

<211> 1071 

<212> DNA 

15 <213> Trigonopsis variabilis 

<220> 

<221> CDS 

<222> (1)..{1068) 
20 <223> 

<400> 7 

atg get aaa ate gtt gtt att ggt gec ggt gtt gec ggt tta act aca 4 8 

Met Ala Lys lie Val Val lie Gly Ala Gly Val Ala Gly Leu Thr Thr 

25 1 5 10 15 

get ctt caa ctt ctt cgt aaa gga cat gag gtt aca att gtg tec gag 96 
Ala Leu Gin Leu Leu Arg Lys Gly His Glu Val Thr He Val Ser Glu 

20 . 25 30 

ttt acg ccc ggt gat ctt agt ate gga tat ace teg cct tgg gca ggt 14 4 

30 Phe Thr Pro Gly Asp Leu Ser He Gly Tyr Thr Ser Pro Trp Ala- Gly 
35 40 45 

gec aac tgg etc aca ttt tac gat gga ggc aag tta gee gac tac gat '• 192 
Ala Asn Trp Leu Thr Phe . Tyr Asp Gly Gly .Lys Leu Ala Asp Tyr Asp 
50 55 60 

35 gec gtc tct tat cct ate ttg cga gag ctg get cga age age ccc gag 24 0 

Ala Val Ser Tyr Pro He Leu Arg Glu Leu Ala Arg Ser Ser Pro Glu 
65 70 75 80 

get gga att cga etc ate age caa cgc tec cat gtt etc aag cgt gat 288 
Ala Gly lie Arg Leu He Ser Gin Arg Ser His Val Leu Lys Arg Asp 

40 85 90 95 

ctt cct aaa ctg gaa gtt gee atg teg gec ate tgt caa cgc aat ccc 336 
Leu Pro Lys Leu Glu Val Ala Met Ser Ala He Cys Gin Arg Asn Pro 

100 105 110 

tgg ttc aaa aac aca gtc gat tct ttc'gag att ate gag gac agg tec 384 

45 Trp Phe Lys Asn Thr Val Asp Ser Phe Glu He He Glu Asp Arg Ser 
115 120 125 

agg att gtc cac gat gat gtg get tat eta gtc gaa ttt cgt tec gtt 432 
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Arg lie Val His Asp Asp Val Ala Tyr Leu Val Glu Phe Arg Ser Yal 

130 135 140 

tgt ate cac acc gga gtc tac ttg aac tgg ctg atg tec caa tgc ~tta 4 80 

Cys lie His Thr Gly Val Tyr Leu Asn Trp Leu Met Ser Gin Cys leu 

5 145 150 155 160 

teg etc ggc gee acg gtg gtt aaa cgt cga gtg aac cat ate aag <gat 528 

Ser Leu Gly Ala Thr Val Val Lys Arg Arg Val Asn His lie Lys Asp 

165 170 175 

gee aat tta eta cac tec tea gga tea cgc ccc gac gtg att gtc aac 57 6 

10 Ala Asn Leu Leu His Ser Ser Gly Ser Arg Pro Asp Val lie Val --Asn 
180 185 190 

tgt agt ggt etc ttt gee egg ttc ttg gga ggc gtc gag gac aag aag 624 

Cys Ser' Gly Leu Phe Ala Arg Phe Leu Gly Gly Val Glu Asp Lys lys 
195 200 205 

15 atg tac cct att cga gga caa gtc gtc ctt gtt cga aac tct ctt cct 672 

Met Tyr Pro lie Arg Gly Gin Val Val Leu Val Arg Asn Ser Leu Pro 

210 215 220 

ttt atg gee tec ttt tec age act cct gaa aaa gaa aat gaa gac gaa 720 

Phe Met Ala Ser Phe Ser Ser Thr Pro Glu Lys Glu Asn Glu Asp Glu 

20 225 230 235 240 

get eta tat ate atg acc cga ttc gat ggt act tct ate att ggc ggt 7 68 

Ala Leu Tyr lie Met Thr Arg Phe Asp Gly Thr Ser lie lie Gly Gly 

245 250 255 

tgt ttc caa ccc. aac aac tgg tea tec gaa ccc gat cct tct etc acc 816 

25 Cys Phe Gin Pro Asn Asn Trp Ser Ser Glu Pro Asp Pro Ser Leu Thr 
260 265 270 

cat cga ate ctg tct aga gee etc gac cga ttc ccg gaa ctg acc aaa 8 64 

His Arg lie Leu Ser Arg Ala Leu Asp Arg Phe Pro Glu Leu Thr Lys 
275 ■ 280 • • 285 

30 gat ggc cct ctt gac att gtg cgc gaa tgc gtt ggc- cac cgt cct ggt 912 

Asp Gly Pro Leu Asp lie Val Arg Glu Cys Val Gly His Arg Pro Gly 

290 295 ... 300 

aga gag ggc ggt ccc cga gta gaa tta gag aag ate ccc ggc gtt ggc 960 

Arg Glu Gly Gly Pro Arg Val Glu Leu Glu Lys lie Pro Gly Val Gly 

35 305 310 315- 320 

ttt gtt gtc cat aac tat ggt gee gee ggt get ggt tac caa tec tct 1008 

Phe Val Val His Asn Tyr Gly Ala Ala Gly Ala Gly Tyr Gin Ser Ser 

325 330 335 

tac ggc atg get gat gaa get gtt tct tac .gtc gaa aga get ctt act 1056 
40 Tyr Gly Met Ala Asp Glu Ala Val Ser. Tyr Val Glu Arg Ala Leu Thr 
340 345 350 
cgt cca aac ctt tag 1071 
Arg Pro Asn Leu 
355 



45 



<210> 8 
<211> 356 
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<212> PRT 

<213> Trigonopsis variabilis 
<400> 8 

5 Met Ala Lys lie Val Val lie Gly Ala Gly Val Ala Gly Leu Thr Thr 
15 10 15 

Ala Leu Gin Leu Leu Arg Lys Gly His Glu Val Thr lie Val Ser Glu 
20 25 30 

10 

Phe Thr Pro Gly Asp Leu Ser lie Gly Tyr Thr Ser Pro Trp Ala Gly 
35 - 40 45 

Ala Asn Trp Leu Thr Phe Tyr Asp Gly Gly Lys Leu Ala Asp Tyr Asp 
15 50 55 60 

Ala Val Ser Tyr Pro lie Leu Arg Glu Leu Ala Arg Ser Ser Pro Glu 
65 70 75 80 

20 Ala Gly lie Arg Leu lie Ser Gin Arg Ser His Val Leu Lys Arg Asp 

85 90 95 

Leu Pro Lys Leu Glu Val Ala Met Ser Aia lie Cys Gin Arg Asn Pro 
100 105 110 

25 

Trp Phe Lys Asn Thr Val Asp Ser Phe Glu lie lie Glu Asp Arg Ser 
115 120 125 

Arg lie Val His Asp Asp Val Ala Tyr Leu Val Glu Phe Arg Ser Val 
30 130 135 140 

Cys lie His Thr Gly Val Tyr Leu Asn Trp Leu Met Ser Gin Cys Leu 
145 150 155 160 

35 Ser Leu Gly Ala Thr Val Val Lys Arg Arg Val Asn His lie Lys Asp 

165 170 175 

Ala Asn Leu Leu His Ser Ser Gly Ser Arg Pro Asp Val lie Val Asn 
180 185 190 

40 

Cys Ser Gly Leu Phe Ala Arg Phe Leu Gly Gly Val Glu Asp Lys Lys 
195 . 200 205 

' Met Tyr Pro lie Arg Gly Gin Val Val Leu Val Arg Asn Ser Leu Pro 
45 210 215 220 



Phe Met Ala Ser Phe Ser Ser Thr Pro Glu Lys Glu Asn Glu Asp Glu 
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225 230 235 240 

Ala Leu Tyr lie Met Thr Arg Phe Asp Gly Thr Ser lie lie Gly Gly 
245 250 255 

5 

Cys Phe Gin Pro Asn Asn Trp Ser Ser Glu Pro Asp Pro Ser Leu Thr 
260 265 270 

His Arg lie Leu Ser Arg Ala Leu Asp Arg Phe Pro Glu Leu Thr Lys 
10 275 280 285 ■ 

Asp Gly Pro Leu Asp lie Val Arg Glu Cys Val Gly His Arg Pro Gly 
290 295 300 

15 Arg Glu Gly Gly Pro Arg Val Glu Leu Glu Lys lie Pro Gly Val Gly 
305 310 315 320 

Phe Val Val His Asn Tyr Gly Ala Ala Gly Ala Gly Tyr Gin Ser Ser 
325 • 330 335 

20 

Tyr Gly Met Ala Asp Glu Ala Val Ser Tyr Val Glu Arg Ala Leu Thr 
340 345 - 350 

Arg Pro Asn Leu 
25 ' 355 

<210> 9 

<211> 1047 

<212> DNA ■ . 

30 <213> Schizosaccharomyces pombe. 

<220> 

<221> CDS 

<222> (22) . . (1041) 

35 <223> coding for DAAO 

<400> 9 

atgactaagg aaaataagcc a aga gat att gtc ate gtt ggc get ggc gtt 51 

Arg Asp lie Val He Val Gly Ala Gly Val 

40 1 5 10 

att gga ttg acc act get tgg att ctt tea gac ttg ggt ctt get cct 99 

He Gly Leu Thr Thr Ala Trp He Leu Ser Asp Leu Gly Leu Ala Pro 

15 20 25 

cgt att aag gtg att gec aag tat acg cct gaa gat cgt tct gta gaa' 147 

45 Arg lie Lys Val He Ala Lys Tyr Thr Pro Glu Asp Arg Ser Val Glu 

30 35 40 

tac act tec cct tgg get ggc gca aat ttc tgt age att tct get act 195 
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Tyr Thr Ser Pro Trp Ala Gly Ala Asn Phe Cys Ser He Ser Ala Thr 

45 50 55 

gat gac aat get ttg cgc tgg gat aaa ate act tac cat cgt ttc gec 243 

Asp Asp Asn Ala Leu Arg Trp Asp Lys He Thr Tyr His Arg Phe Ala 
5 60 65 7 0 

tac ttg gcg aaa act cgt cct gaa gca gga ate cgt ttt get gat ctt 291 

Tyr Leu Ala Lys Thr Arg Pro Glu Ala Gly He Arg Phe Ala Asp Leu 

75 80 85 90 

cga gaa ttg tgg gag tac gag ccg aaa cac gac aaa ate aga tec tgg 339 

10 Arg Glu Leu Trp Glu Tyr Glu Pro Lys His Asp Lys He Arg Ser Trp ■ 

95 100 105 

aat ace tat gtc aga gat ttc aaa gtt ate cct gaa aaa gat ctt cca 387 

Asn Thr Tyr Val Arg Asp Phe Lys Val He Pro Glu Lys Asp Leu Pro 
110 115 120 

15 gga gaa tgt ate tac gga cat aag gee ace ace ttt tta ate aac get 435 

Gly Glu Cys He Tyr Gly His Lys Ala Thr Thr Phe Leu He Asn Ala 

125 130 135 

cct cat tac ttg aat tat atg tac aag ctg etc att gaa get ggc gtc 4-83 

Pro His Tyr Leu Asn Tyr Met Tyr Lys Leu Leu He Glu Ala Gly Val 
20 140 145 150 

gaa ttt gaa aag aaa gaa ttg agt cac ate aaa gag act gtc gaa gaa 531 

Glu Phe Glu" Lys Lys Glu Leu Ser His He Lys Glu Thr Val Glu Glu 

155 160 165 170 

act cca gaa get tea gta gta ttt aat tgc act ggt etc tgg get tec 57 9 

25 Thr Pro Glu Ala Ser Val Val Phe Asn Cys Thr Gly Leu Trp Ala Ser 

175 180 185 

aaa ttg ggt ggc gtt gaa gac ccg gac gtt tat ccg act cgt gga cat 627 

Lys Leu Gly Gly Val Glu Asp Pro Asp Val Tyr Pro Thr Arg Gly His 
190' 195 • 200 

30 gtt gtt ttg gtt aag get cct cat- gta • aca gaa act cgc att ttg aat 675 

Val Val Leu Val Lys Ala Pro His Val Thr Glu Thr Arg He Leu Asn 

205 210 215 

ggc aag aac tct gat acc tat att att cct cgt ccc tta aat ggt gga 723 

Gly Lys Asn Ser Asp Thr Tyr He He Pro Arg Pro Leu Asn Gly Gly 
35 220 225 230 

gtc att tgc ggc ggt ttc atg caa cca gga aac tgg gat cgt gaa att 771 
Val lie Cys Gly Gly Phe Met Gin Pro Gly Asn Trp Asp Arg Glu He 

235 240 245 250 

cac cct gaa gac act ttg gat ate ctt aag aga aca teg get ttg atg 819 

40 His Pro Glu Asp Thr Leu Asp He Leu Lys Arg Thr Ser Ala Leu Met 

255 260 265 

cca gaa ttg ttc cac ggc aag ggt ccg gag ggt get gaa att att caa 8 67 

Pro Glu Leu Phe His Gly Lys Gly Pro Glu Gly Ala Glu He He Gin 
• ' 270 275 280 

45 gaa tgt gtc gga ttc cgt cct tct cga aag ggt ggt gee cgc gta gag 915 

Glu Cys Val Gly Phe Arg Pro Ser Arg Lys Gly Gly Ala Arg Val Glu 
285 290 295 



10 
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ctt gat gtt gtt ccc ggc acc tea gtc ccc ctt gtt cat gat tac ggt 963 
Leu Asp Val Val Pro Gly Thr Ser Val Pro Leu Val His Asp Tyr Gly . 

300 305 310 

get tct ggc aca gga tac caa get ggt tat ggt atg get ctt gac tct 1011 
Ala Ser Gly Thr Gly Tyr Gin Ala Gly Tyr Gly Met Ala Leu Asp Ser 
315 320 325 330 

gtc atg ttg get ctt cct aaa ate aaa ttg gcttag 1047 
Val Met Leu Ala Leu Pro Lys lie Lys Leu 
335 340 

<210> 10 
<211> 340 
<212> PRT 

<213>< Schizosaccharomyces pombe 
<400> 10 

Arg Asp lie Val He Val Gly Ala Gly Val lie Gly Leu Thr Thr Ala 
15 10 15 

20 Trp lie Leu Ser Asp Leu Gly Leu Ala Pro Arg He Lys Val He Ala 
20 25 . 30 

Lys Tyr Thr Pro Glu Asp Arg Ser Val Glu Tyr Thr Ser Pro Trp Ala 
35 40 45 



15 



25 



40 



Gly Ala Asn Phe Cys Ser He Ser Ala Thr Asp Asp Asn Ala Leu Arg 
50 55 60 



Trp Asp Lys He Thr Tyr His Arg Phe Ala Tyr Leu Ala Lys Thr Arg 
30 65 70 75 80 

Pro Glu Ala Gly He Arg Phe Ala Asp Leu Arg Glu Leu Trp Glu Tyr 
85 90 95 

35 Glu Pro Lys His Asp Lys He Arg Ser Trp Asn Thr Tyr Val Arg Asp 
100 105 110 

Phe Lys Val lie Pro Glu Lys Asp Leu Pro Gly Glu Cys He Tyr Gly 
115 . 120 125 



His Lys Ala Thr Thr Phe Leu He Asn Ala Pro His Tyr Leu Asn Tyr 
130 135 140 



Met Tyr Lys Leu Leu He Glu Ala Gly Val Glu Phe Glu Lys Lys Glu 
45 145 150 155 160 



Leu Ser His lie Lys Glu Thr Val Glu Glu Thr Pro Glu Ala Ser Val 
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165 



17/34 
170 



PCT/EP2005/002735 
IL75 



Val Phe Asn Cys Thr Gly Leu Trp Ala Ser Lys Leu Gly Gly Val Glu 
180 185 190 

5 

Asp Pro Asp Val Tyr Pro Thr Arg Gly His Val Val Leu Val Lys Ala 
195 200 205 

Pro His Val Thr Glu Thr Arg lie Leu Asn Gly Lys Asn Ser ^sp Thr 
10 210 215 220 

Tyr lie lie Pro Arg Pro Leu Asn Gly Gly Val lie Cys Gly Gly Phe 
225 230 235 240 

15 Met Gin Pro Gly Asn Trp Asp Arg Glu lie His Pro Glu Asp Thr Leu 

245 250 255 

Asp He Leu Lys Arg Thr Ser Ala Leu Met Pro Glu Leu Phe IHis Gly 
260 265 270 

20 

Lys Gly Pro Glu Gly Ala Glu He He Gin Glu Cys Val Gly The Arg 
275 280 ' 285 

Pro Ser Arg Lys Gly Gly Ala Arg Val Glu Leu Asp Val Val Pro Gly 
25 290 295 300 

Thr Ser Val Pro Leu Val His Asp Tyr Gly Ala Ser Gly Thr Gly Tyr 
305 310 315 320 

30 Gin Ala Gly Tyr Gly Met Ala Leu Asp Ser Val Met Leu Ala Leu Pro 

325 330 335 

Lys lie Lys Leu 
340 

35 

<210> 11 
<211> 963 
<212> DNA 

<213> Streptomyces coelicolor 

40 

<220> 

<221> CDS 

<222> (31).. (957) 

<223> 'coding for DAAO 

45 

<220> 

<221> misc feature 
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<222> (880) . . (936) 
<223> DAAO signature 

<400> 11 

5 gtggaaaccg aactggatga cgagcgggat ggc gaa gtc gtc gtg gtc ggc ggc 54 

Gly Glu Val Val Val Val Gly Gly 
1 5 

ggg gtg ate ggg ctg acg acg gec gtc gtc etc gec gag egg ggc aga 102 
Gly Val lie Gly Leu Thr Thr Ala Val Val Leu Ala Glu Arg Gly Arg 

10 10 15 20 

egg gtg egg ctg tgg acc egg gag ccc gcg gag egg ace ace teg gtg 150 

Arg Val Arg Leu Trp Thr Arg Glu Pro Ala Glu Arg Thr Thr Ser Val 

25 30 -35 40 

gta gcg ggc ggg ctg tgg tgg ccg tac egg ate gag ccg gtc gcg ctg 198 

15 Val Ala Gly Gly Leu Trp Trp Pro Tyr Arg lie Glu Pro Val Ala Leu 

45 50 55 

gec cag gee tgg gcg ctg cgt tec ctg gac gtg tac gag gag ctg gcg 24 6 

Ala Gin Ala Trp Ala Leu Arg Ser Leu Asp Val Tyr Glu Glu Leu Ala 
60 65 70 

20 gca egg ccc ggg cag acc ggc gta cgc atg etc gaa ggg gtg etc ggc 294 
Ala Arg Pro Gly Gin Thr Gly Val Arg Met Leu Glu Gly Val Leu Gly 

75 80 85 

gag acc ggc ctg gac gag gtg gac .ggg tgg gee gcg gee egg ctg ccg 342 
Glu Thr Gly Leu Asp Glu Val Asp Gly Trp Ala Ala Ala Arg Leu Pro 

25 90 95 100 

ggg ctg cgc gcg gcg age gee gec gag tac gee ggg acg ggg ctg tgg 390 

Gly Leu Arg Ala Ala Ser Ala Ala Glu Tyr Ala Gly Thr Gly Leu Trp 

105 110 115 120 

gcg egg ctg ccg etc ate gac atg teg acc cat ctg ccg tgg ctg egg 438 

30 Ala Arg Leu Pro Leu lie Asp Met Ser Thr His Leu Pro Trp Leu Arg 

125 130 135 

gag egg ctg ctg gee gcg ggc ggc acg gtg gag gac cgc gcg gtg acc 48 6 

Glu Arg Leu Leu Ala Ala Gly Gly Thr Val Glu Asp Arg Ala Val Thr 
140 145 150 

35 gat ctg gee gag gcg gac gcg ccg gtg gtg gtc aac tgc acc ggc ctg 534 
Asp Leu Ala Glu Ala Asp Ala Pro Val Val Val Asn Cys Thr Gly Leu 

155 160 165 

ggc gee egg gag ctg gtg ccg gac ccg gcg gta egg ccg gtg cgc gga 582 
Gly Ala Arg Glu Leu Val Pro Asp Pro Ala Val Arg Pro Val Arg Gly 

40 170 175 180 

cag ctg gtc gtc gtg gag aac ccc ggc ate cac aac tgg ctg gtc gcg 630 

Gin Leu Val Val Val Glu Asn Pro Gly lie His Asn Trp Leu Val Ala 

185 190 195 200 

gee gac gcg .gac tec ggg gag acg acg tac ttc ctt ccg cag ccg gga 678 

45 Ala Asp Ala Asp Ser Gly Glu Thr Thr Tyr Phe Leu Pro Gin Pro Gly 

205 210 215 

egg etc ctg ctg ggc ggc acg get gag gag gac gec tgg teg acc gag 726 



WO 2005/090582 

Arg Leu Leu Leu 
220 

ccg gac ccg gag 
Pro Asp Pro Glu 
235 

cgt ccc gag ate 
Arg Pro Glu lie 
250 

egg ccg gec egg 
Arg Pro Ala Arg 
265 

<3<3<3 cgc egg ctg 
Gly Arg Arg Leu 



15 gtg gee tgg ggc 
Val Ala Trp Gly 
300 



10 



19/34 

Gly Gly Thr Ala Glu Glu Asp Ala Trp 
225 

gtc gcg gcg gec ate gtg cga egg tgc 
Val Ala Ala Ala lie Val Arg Arg Cys 
240 245 
gee gga gcg egg gtg etc gcg cac ctg 
Ala Gly Ala Arg Val Leu Ala His Leu 



gac gcg 
Asp Ala 
270 
gtg cac 
Val His 
285 

tgc get 
Cys Ala 



gcg 
Ala 
255 
gtc 
Val 

aac 
Asn 

cag 
Gin 



egg ctg gag cgc 
Arg Leu Glu Arg 
275 

tac ggt cac ggc 
Tyr Gly His Gly 
290 

gag gcg gec egg 
Glu Ala Ala Arg 
305 



cac 
His 
260 

ggg 

Gly 



acg 
Thr 



ggc gcg 
Gly Ala 

etc gec 
Leu Ala 
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Ser Thr Glu 
230 

gcg gec ctg 774 
Ala Ala Leu 

gtg ggg ctg 822 
Val Gly Leu 

ctg ccg gac 870 
Leu Pro Asp 
280 

ggc gtc acc 918 
Gly Val Thr 
295 

tcctga 963 



<210> 12 

20 <211> 309 

<212> PRT 

<213> Streptornyces coelicolor 
<220> 

25 <221> mis cofeature 

<222> (880) . . (936) 

<223> DAAO signature 

<400> 12 

30 Gly Glu Val Val Val Val Gly Gly Gly Val lie Gly Leu Thr- Thr Ala 
15 10 15 

Val Val Leu Ala Glu Arg Gly Arg Arg Val Arg Leu Trp Thr Arg Glu 
20 25 30 



35 



Pro Ala Glu Arg Thr Thr Ser Val Val Ala Gly Gly Leu Trp Trp Pro 
35 40 45 



40 



Tyr Arg He Glu Pro Val Ala Leu Ala Gin Ala Trp Ala Leu Arg Ser 
50 55 60 



Leu Asp Val Tyr Glu Glu Leu Ala Ala Arg Pro Gly Gin Thr Gly Val 
65 70 75 80 



45 Arg Met Leu Glu Gly Val Leu Gly Glu Thr Gly Leu Asp Glu Val Asp 

85 90 95 
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Gly Trp Ala Ala Ala Arg Leu Pro Gly Leu Arg Ala Ala Ser Ala Ala 
100 105 110 

Glu Tyr Ala Gly Thr Gly Leu Trp Ala Arg Leu Pro Leu lie Asp Met 
5 115 120 125 

Ser Thr His Leu Pro Trp Leu Arg Glu Arg Leu Leu Ala Ala Gly Gly 
130 135 140 

10 Thr Val Glu Asp Arg Ala Val Thr Asp Leu Ala Glu Ala Asp Ala Pro 
145 150 155 160 

Val Val Val Asn Cys Thr Gly Leu Gly Ala Arg Glu Leu Val Pro Asp 
165 170 175 

15 

Pro Ala Val Arg Pro Val Arg Gly Gin Leu Val Val Val Glu Asn Pro 
180 185 190 

Gly lie His Asn Trp Leu Val Ala Ala Asp Ala Asp Ser Gly Glu Thr 
20 195 200 205 

Thr Tyr Phe Leu Pro Gin Pro Gly Arg Leu Leu Leu Gly Gly Thr Ala 
210 215 . 220 

25 Glu Glu Asp Ala Trp Ser Thr Glu Pro Asp Pro Glu Val Ala Ala Ala 
225 230 235 240 

lie Val Arg Arg Cys Ala Ala Leu Arg Pro Glu lie Ala Gly Ala Arg 
245 250 255 

30 

Val Leu Ala His Leu Val Gly Leu Arg Pro Ala Arg Asp Ala Val Arg 
260 265 270 

Leu Glu Arg Gly Thr Leu Pro Asp Gly Arg Arg Leu Val His Asn Tyr 
35 275 280 285 

Gly His Gly Gly Ala Gly Val Thr Val Ala Trp Gly Cys Ala Gin Glu 
290 295 300 

40 Ala Ala Arg Leu Ala 
305 

<210> 13 
<211> 1038 
45 <212> DNA 

<213> Candida boidinii 
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<220> 

<221> CDS 

<222> (1)..(1035) 

<223> coding for DAAO 

5 

<400> 13 

atg ggt gat caa att gtt gtt ctt ggt tec ggt att att ggt tta tat 4 8 

Met Gly Asp Gin He Val Val Leu Gly Ser Gly He He Gly Leu Tyr 
15 10 15 

10 act aca tac tgt tta ate tat gag get gga tgt get cca get aaa att 96 
Thr Thr Tyr Cys Leu He Tyr Glu Ala Gly Cys Ala Pro Ala Lys He 

20 25 30 

act att gtt get gaa ttt tta cca ggt gat caa tct aca tta tat aca 14 4 

Thr He Val Ala Glu Phe Leu Pro Gly Asp Gin Ser Thr Leu Tyr Thr 

15 35 40 45 

tct cca tgg gca ggt ggt aat ttt tct tgt att tea cca get gat gat 192 

Ser Pro Trp Ala Gly Gly Asn Phe Ser Cys He Ser Pro Ala Asp Asp 

50 55 60 

aca aca ttg get tat gat aaa ttc aca tat ctt aat tta ttc aag att 240 

20 Thr Thr Leu Ala Tyr Asp Lys Phe Thr Tyr Leu Asn Leu Phe Lys He 

65 70 75 80 

cac aaa aaa tta ggt gga cca gaa tgt gga tta gat . aat aag cca agt 28 8 

His Lys Lys Leu Gly Gly Pro Glu Cys Gly Leu Asp Asn Lys Pro Ser 
85 90 95 

25 act gaa tat tgg gat ttt tat cct . ggt gat gaa aaa gtc aat tct tta 336 

Thr Glu Tyr Trp Asp Phe Tyr Pro Gly Asp Glu Lys Val Asn Ser Leu 

100 ■ 105 110 

aaa caa tat ctt aaa gat ttt aaa gtt att cca aaa tea gaa tta cca 38 4 

Lys Gin Tyr Leu Lys Asp Phe Lys Val He Pro Lys Ser Glu Leu Pro 

30 115 120 . 125 

gaa ggt gtt gaa tat ggt att agt tat act aca tgg aat ttc aac tgt 432 

Glu Gly Val Glu Tyr Gly He Ser Tyr Thr Thr Trp Asn Phe Asn Cys 

130 135 140 

cct gtt ttc tta caa aat atg get aat ttt tta aat aaa aga aat gtt 4 80 

35 Pro Val Phe Leu Gin Asn Met Ala Asn Phe Leu Asn Lys Arg Asn Val 

145 150 155 160 

acc att att aga aaa cat tta aca cat att tct caa get tat tta aca 528 

Thr He He Arg Lys His Leu Thr His He Ser Gin Ala Tyr Leu Thr 
165 170 175 

40 gtt aat aca aaa gtt gtt ttc aac tgt aca ggt att ggt get get gat 57 6 

Val Asn Thr Lys Val Val Phe Asn Cys Thr Gly He Gly Ala Ala Asp 

180 185 190 

tta ggt ggt gtt aaa gat gaa aaa gtt tat cca act aga- gga caa gtt 624 

Leu Gly Gly Val Lys Asp Glu Lys Val Tyr Pro Thr Arg Gly Gin Val 

45 195- 200 205 

gtt gtt gtt aga get cca cat att caa gaa aat aaa atg aga tgg ggt 672 

Val Val Val Arg Ala Pro His lie Gin Glu Asn Lys Met Arg Trp Gly 
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210 215 220 

aaa gac tat get act tat att att cca aga cca tat tct aat ggt gaa 720 
Lys Asp Tyr Ala Thr Tyr lie lie Pro Arg Pro- Tyr Ser Asn Gly Glu 
225 230 235 240 

5 tta gtc tta ggt ggt ttc tta caa aag gat aat tgg aca ggt aat act 7 68 

Leu Val Leu Gly Gly Phe Leu Gin Lys Asp Asn Trp Thr Gly Asn Thr 

245 250 255 

ttt ggt ttt gaa act gat gat att gtt agt aga act aca tct tta tta 816 
Phe Gly Phe Glu Thr Asp Asp lie Val Ser Arg Thr Thr Ser Leu Leu 
10 260 265 270 

cca aag att tta gat gaa cca ctt cat att att aga gtt gca get ggt 8 64 

Pro Lys lie Leu Asp Glu Pro Leu His He He Arg Val Ala Ala Gly 

275 280 285 

tta aga cca agt aga cat ggt ggt cca aga att gaa get gaa gtt tgt 912 
15 Leu Arg Pro Ser Arg His Gly Gly Pro Arg He Glu Ala Glu Val Cys 
290 295 300 

gaa gaa ggt aaa tta act att cat aat tat ggt get tct gga tat ggt 960 
Glu Glu Gly Lys Leu Thr He His Asn Tyr Gly Ala Ser Gly Tyr Gly 
305 310 315 320 

20 tat caa get ggt tat ggt atg tct tat gaa get gtc aaa ctt tta gtt 1008 
Tyr Gin Ala Gly Tyr Gly Met Ser Tyr Glu Ala Val Lys Leu Leu Val 

325 330 335 

gat aac caa aaa gtt" aaa get aaa ctt tag 
Asp Asn Gin Lys Val Lys Ala Lys Leu 
25 340 345 

<210> 14 

<211> 345 

<212> PRT 

30 <213> Candida boidinii 

<400> 14 

Met Gly Asp Gin He Val Val Leu Gly Ser Gly He He Gly Leu Tyr 
15 10 15 



35 



Thr Thr Tyr Cys Leu He Tyr Glu Ala Gly Cys Ala Pro Ala Lys He 
20 25 30 



Thr He Val Ala Glu Phe Leu Pro Gly Asp Gin Ser Thr Leu Tyr Thr 

40 35 40 45 

Ser Pro Trp Ala Gly Gly Asn Phe Ser Cys He Ser Pro Ala Asp Asp 
50 55 60 

45 Thr Thr Leu Ala Tyr Asp Lys Phe Thr Tyr Leu Asn Leu Phe Lys He 
65 70 75 80 



1038 
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His Lys Lys Leu Gly Gly Pro Glu Cys Gly Leu Asp Asn Lys Pro Ser 
85 90 95 

Thr Glu Tyr Trp Asp Phe Tyr Pro Gly Asp Glu Lys Val Asn Ser Leu 
5 100* 105 110 

Lys Gin Tyr Leu Lys Asp Phe Lys Val lie Pro Lys Ser Glu Leu Pro 
115 120 125 

10 Glu Gly Val Glu Tyr Gly lie Ser Tyr Thr Thr Trp Asn Phe Asn Gys 
130 135 140 

Pro Val Phe Leu Gin Asn Met Ala Asn Phe Leu Asn Lys Arg Asn Val 
145 150 155 160 

15 

Thr He lie Arg Lys His Leu Thr His He Ser Gin Ala Tyr Leu Thr 
165 170 175 

Val Asn Thr Lys Val Val Phe Asn Cys Thr Gly lie Gly Ala Ala Asp 
20 180 185 190 

Leu Gly Gly Val Lys Asp Glu Lys Val Tyr Pro Thr Arg Gly Gin Val 
195 200 205 

25 Val Val Val Arg Ala Pro His He Gin Glu Asn Lys Met Arg Trp Gly 
210 215 220 

Lys Asp Tyr Ala Thr Tyr He He Pro Arg Pro Tyr Ser Asn Gly Glu 
225 230 235 240 

30 

Leu Val Leu Gly Gly Phe Leu Gin Lys Asp Asn Trp Thr Gly Asn Thr 
245 250 255 

Phe Gly Phe Glu Thr Asp Asp He Val Ser Arg Thr Thr Ser Leu Leu 
35 260 265 270 

Pro Lys He Leu Asp Glu Pro Leu His He He Arg Val Ala Ala Gly 
275 280 285 

40 Leu Arg Pro Ser Arg His Gly Gly Pro Arg He Glu Ala Glu Val Cys 
290 295 300 

Glu Glu Gly Lys Leu Thr He His Asn Tyr Gly Ala Ser Gly Tyr Gly 
"305 31'0 315 320' 

45 

Tyr Gin Ala Gly Tyr Gly Met Ser Tyr Glu Ala Val Lys Leu Leu Val 
325 330 335 
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Asp Asn Gin Lys Val Lys Ala Lys Leu 
340 345 

<210> 15 
<211> 12334 
<212> DNA 

<213> STPT GUS Nit-P daao 



10 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(38) . . (183) 

Agrobacterium Right Border 



15 <220> 

<221> terminator 

<222> (384) . . (639) 

<223> complementary: NOS terminator 



20 <220> 
<221> 
<222> 
<223> 



mis cofeature 
(716) . . (1822) 
complementary : 



coding for Rhodotorula gracilis DAAO 



25 <220> 
<221> 
<222> 
<223> 



promoter 
(1866) . . (3677) 

complementary: A.thaliana nitrilase I promoter 



30 



<220> 
<221> 
<222> 
<223> 



terminator 
(3767) . . (3971) 
complementary : 



35S terminator 



35 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(4046) . . (6043) 

complementary: coding for beta-glucuronidase (comprising intron) 



40 



<220> 
<221> 
<222> 
<223> 



promoter 
(6097) . . (7414) 
complementary : 



sTPT promoter 



45 



<220> 
<221> 



misc feature 
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<222> (7486) . . (7702) 

<223> Agrobacterium Left Border 

<400> 15 
5 aatattcaaa caaacacata cagcgcgact 
aaaccttttc acgccctttt aaatatccga 
tttacccgcc aatatatcct gtcaaacact 
atcagatctg gtaccttgcg ccgggtaccc 
ttaccctgtt atccctagtc gagcggccgc 

10 ttttagatca gcacactggc ggccgttact 
aacgactcag agcttgacag gaggcccgat 
atttatccta gtttgcgcgc tatattttgt 
ggactctaat cataaaaacc catctcataa 
catgcttaac gtaattcaac agaaattata 

15 tcaatcttaa gaaactttat tgccaaatgt 
gcgggaactc cacgaaaata tccgaacgca 
acttcgactc ccgcgccgcg ccgtggtacc 
catcctccgc cgcgccccaa ctctgctggt 
caagcgtgac ctccttctcc ttcgccgctc 

20 actttgtccg gtcgagaggc aggacgatcc 
gtgcaggtcg caagccgacg t.tgtggcgga 
tcgagatggt cgggtcgagg cgcaagcagt 
tgacagacaa gtcccagtct cccacgccgt 
gtcggggaat gatgtaggcg ggagaagcgg 

25 atggggactt gacgaggacg gtttgcccgc 
ccgcaatcga cttggcgcca agtcccgtag 
cctgctcaag cgacgtaacg gtccgtctct 
ttgcaaggta ctggcagtac tttggtgcgt 
tagcgccagg tggacattcg gaagatggga 

30 accagtgccc gagcaagccg tcttcgttct 
tggcatggcc cgtcgggacc aactcgaccc 
cttgtcgagg accgtctgta agcgtcatga 
aagcgaaagt ctggctcgag acgtcctccg 
agcccttccg agcgaggatg agggcgctgc 

35 cgacgcgctt ctgcgagtgc atgggccctc 
-ctcgagtctt tgttttttac tttggttcat 
tagacttttt tttgtttttt ttttgtggtc 
aagatcgaac ctgtggtgga aatgaaacaa 
ctgcattgat ttgtttttgt tttttttgta 

40 aagaaaaata aaatctattg attgattaaa 
aagaagtgaa aaccccgttt aagaattaca 
aaatccaaac gaaataaaac aaaatggaga 
aaaagaattt attccatttt ttttcccgta 
catatttgat gcattttgat tcctcataat 

45 gtttctaatt cgtattaacc tacctttttt 
acggaattga tgttttggac cacttagaaa 
tttcattaaa tataaattta aataagaaat 



tatcatggac 


atacaaatgg 


acgaacggat 


60 


ttattctaat 


aaacgctctt 


ttctcttagg 


120 


gatagtttaa 


actgaaggcg 


ggaaacgaca 


180 


caaactgtct 


cacgacgttt 


tgaacccaga 


240 


cagtgtgatg 


gatatctgca 


gaattcgccc 


300 


agtggatcaa 


ttcactggcc 


gtcgttttac 


360 


ctagtaacat 


agatgacacc 


gcgcgcgata 


420 


tttctatcgc 


gtattaaatg tataattgcg 


480 


ataacgtcat 


gcattacatg ttaattatta 


540 


tgataatcat 


cgcaagaccg 


gcaacaggat 


600 


ttgaacgatc 


ggggatcatc 


cgggtctgtg 


660 


gcaagatcta 


gagcttgggt 


cccgcctaca 


720 


gctggaacgc 


ctcgtcgacg 


agctgcgcga 


780 


atcccgcact 


cgagaagcca 


tacgcatgca 


840 


gtgcgctgcc 


cctgccgagc 


gagaggggcg 


900 


gttctgcctc 


aacgcggggt 


ccgcctcgtc 


960 


ggacctcgat 


gccttcgatc 


gttccgtcgc 


1020 


gcttgaggat 


ccgctggacc 


gtctctgggt 


1080 


acgtcccgcc 


gcagatgact 


tcgccacctg 


1140 


ggtcggacga 


gtccatcgtg 


catcgcttgc 


1200 


ggattggctc 


ggcggcttgg 


tcgtcgatgc 


1260 


cgttgaccac 


caaatccgca 


ccgtcgaacg 


1320 


caaacgtcgc 


gccgagcttc 


tgcagctctc 


1380 


ggacggagag 


ggtgtcgtag 


gttacgccga 


1440 


gggggcggta 


atttggcgtg 


atgtccttgt 


1500 


gcgcgaaccg 


cctcgtcccc 


ttgagccaca 


1560 


•acttcttgaa 


agtcgattct 


tcccattttg 


1620 


aaggcgtcca 


attcgcgcca 


gcccatggtg 


1680 


gcaagtcgcg 


cgcgagaata 


tgcacgctgt 


1740 


tcagaccgat 


aacgcctgat 


ccgaggacaa 


1800 


gactagagtc 


gagatccgat 


atcgcccggg 


1860 


gacactcaga 


gacttgagag 


aagcaatata 


1920 


acgtttattt 


tcctattgga 


gacggtaacg 


1980 


ggtgggacta 


gcccacgtgg 


tttcttttct 


2040 


aagttcacat 


caaacctact 


aataattgag 


2100 


ccagccgatg 


ctttatgtct 


gaatataaaa 


2160 


acggtggttt 


acaaagtatt 


tggacacaat 


2220 


actaccaaat 


aaaaaacaaa 


taaaaaactt 


2280 


gaatttattc 


ttttatggat 


tccttaaatc 


2340 


aggtaataat 


atatactatg ttatagatat 


'2400 


tggtcgtacg 


attctaccta 


ataatattga 


2460 


gtattttttt 


tttggtttgt 


cttagctgta 


2520 


gtcataaata 


aaatttgacg 


tatagatttt 


2580 
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ttaaatccat tttatgttat ttaatatttg 
ggatgggtta atactaagtt ttccttaatg 
ctaaaaaata gatcaatgtg ttttggtccg 
cgaaaagaaa gtctcatata gtaaatcata 
5 taacatggat tgtatagttt cttatataac 
caattaataa tcaaacgggc ttggtaggaa 
aaaacacaca ctcaagtctt ttatttcatc 
ttgccacttt tctttacaaa taaacacaaa 
caaacactgt ttgtcttttt tcgatcacgt 

10 gggaatgtaa aacaacaaca tgtgaagtca 
ttttcgttgt caatattagt cattccaaaa 
atattcttgg aactaatgta tgtggaaata' 
caaatatgca aagaaaaaaa agaaaagaac 
attggctaca tggttgtaga atgtagggta 

15 ggatgataga tggataatct ttagatacaa 
aataaagaaa aaaagtttag cataaaagtt 
tccttcatag gagtgacatg aactgacgaa 
ccgctttctc caccttcgaa atgaatgtga 
agtgtaaaac atctggtctc gaggtacctg 

20 acgtcgtgag acagtttggt gcaggtcgaa 
tggttttagg aattagaaat tttattgata. 
agggtctctt atatgctcaa cacatgagcg 
tgggaactac tcacacatta ttctggagaa 
actggtgatt tttgcgccgg gtaccgagct 

25 atggtgcgcc aggagagttg ttgattcatt 
agttcatgcc agtccagcgt ttttgcagca 
gagtgaagat ccctttcttg ttaccgccaa- 
cggcgaaatt ccatacctgt tcaccgacga 
acatatccag ccatgcacac tgatactctt 

30 gcccggctaa cgtatccacg ccgtat'tcgg 
gccaggccag aagttctttt tccagtacct 
cataccatcc gtaataacgg ttcaggcaca 
tgtgagcgtc gcagaacatt acattgacgc 
gcgttgcttc cgccagtggc gaaatattcc 

35 tggcaatact ccacatcacc acgcttgggt 
tcgcctgtaa gtgcgcttgc tgagtttccc 
tcggcttgtt gcccgcttcg aaaccaatgc 
tttcatcaat caccacgatg ccatgttcat 
ggtaatgcga ggtacggtag gagttggccc 

40 ccatcagcac gttatcgaat cctttgccac 
cagtaaagta gaacggtttg tggttaatca 
ggatgccgac gcgaagcggg tagatatcac 
gttcatagag ataaccttca cccggttgcc 
cgctagtgcc ttgtccagtt gcaaccacct 

45 catcaccatt ggccaccacc tgccagtcaa 
gcgtcaccac ggtgatatcg tccacccagg 
gatggattcc ggcatagtta aagaaatcat 
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aaatgtgagt 


ttggctccta 


tttaatctxa 


O £Z a a 


aattatctca 


gagaaactgg 


attaaataaa 


on a a 

z / uu 


gtcaaatatc 


tttggattta 


ctattattgg 


£ I bU 


ttcctacaag 


agaaatcaaa 


atttttgaat 




caattagttc 


gcatcaagaa 


aaccaaaccc 


o o q a 


tatttcattg 


cagctttcag ataaaagaaa 


A Ok A A 

zy 4u 


tttcttactt 


gcaggaactc 


aaattccact 


iOUO 


ttgtcaatga 


aacgaaatag tctttttatg 


r\ r\ 
J U bU 


ttctgattgt 


gacagccatc 


catatatata 


3120 


catatacgta 


atggtttagc 


atagcttcta 


318 0 


catttttaag 


aaaaataaat 


taatatatgt 


o o / a 

324 0 


cagtaactta 


attattaaac 


attctaaatg 


o o r\ r\ 


acaactgaaa 


tcaaagccag 


attcataata 


o o c a 


acacaacatc 


cagaattgaa 


cactcaaatt 


.54 £U 


gagaattggt 


tctcttccat 


tattaacgaa 


J>4 o U 


tgaaactcaa 


cataacattt 


tgaacttgac 




tcacaaccga 


1 _ J _ 1 > - -»- 1 A^. 

ttacttgttt 


gagtcatctt 


*3 a a a 
6 bU U 


ccggtttctt 


cgggtgctca 


tttacggtca 


o c ez a 
obbU 


gtagggataa 


cag : ggtaatc 


tgggttcaaa 




attcgagctc 


ggtacccggt 


cactggattt 


O '7 o n 


gaagtatttt 


acaaatacaa 


at a cat act a 


"3 Q A a 


.aaaccctata 


agaaccctaa 


ttcccttatc 


o q n n 


aaatagagag 


agatagattt 


gtagagagag 


o n (Z a 


cggtagcaat 


tcccgaggct 


gtagccgacg 


a a o a 
4 UzU 


gtttgcctcc 


ctgctgcggt 


ttttcaccga 


4 U o U 


gaaaagccgc 


cgacttcggt 


ttgcggtcgc 


a i a a 
4 X4 U 


cgcgcaatat 


gccttgcgag 


gtcgcaaaat 


/ o a a 
4ZUU 


cggcgctgac 


gcgatcaaag 


acgcggtgat 


/ O £ A 


cactccacat 


gtcggtgtac 


attgagtgca 


4ozU 


tgatgataat 


cggctgatgc 


agtttctcct 


/ O Q A 


tctctgccgt 


ttccaaatcg 


ccgctttgga 


A A A A 

4 4 4 U 


gcacatcaaa 


gagatcgctg 


atggtatcgg 


/ C A A 

4 O U U 


aggtgatcgg acgcgtcggg tcgagtttac 


4 o bU 


cgtgcacttg 


cggacgggta 


tccggttcgt 


a /- O A 

4 bZU 


ggtttttgtc 


acgcgctatc 


agctctttaa 


4 bo U 


cgttgactgc 


ctcttcgctg 


tacagttctt 


4 /4U 


ctaaagagag 


gttaaagccg 


acagcagcag 


A O A A 
4 D UU 


ctgcccagtc 


gagcatctct 


tcagcgtaag 


4 8 d(J 


caatccagtc 


cattaatgcg 


tggtcgtgca 


4 yzU 


gtaagtccgc 


atcttcatga 


cgaccaaagc 


/ADA 

4 y O (J 


ggaactgttc 


gcccttcact 


gccactgacc 


O U 4 U 


actctgtctg 


gcttttggct 


gtgacgcaca 


ci r\A 


agaggtgcgg attcaccact tgcaaagtcc 


5160 


gttgatccgc 


atcacgcagt 


tcaacgctga 


5'220 


cagacgcgtg 


gttacagtct 


tgcgcgacat 


5280 


tgttcggcgt 


ggtgtagagc 


attacgctgc 


5340 


ggaagtaaga 


ctgctttttc ttgccgtttt 


5400 
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cgtcggtaat caccattccc 
cggtgatacc tgcacatcaa 
atacacactt ataaactaca 
aaaatatttg aaatattata 
5 gtagaagcag aaacttacgt 
caaatggcgt atagccgccc 
"cgtaatgagt gaccgcatcg 
taaagacttc gcgctgatac 
actgatcgtt aaaactgcct 

10 accaacgctg accaattcca 
gttttttgat ttcacgggtt 
ctagtggatc cgatatcgcc 
tagtgagagc aaagagggac 
gttgatcgga cggctacgat 

15 gttatggtag ggttaatggt 
aatcttgaga tactggttta 
atggttttta gagacacagt 
tgtataattt tgtttttttg 
cattatttac cttgtatgtt 

20 aagtagattt tttataaatg 
ttgtttatgg ttgtctgtca 
attctaggct tttttgaata 
aaagttatac aactataact 
ttttttgttg tagccagttt 

25 ttatgcgttt aactaggatt 
tttttttttt ttctagttcc 
taatttctgt aattgcgatt 
gaacccgagt ttaaatatga 
tacttaatag tggtgttttt 

30 gaagttgatt gagtgattat 
gtgattattg tattagtatg 
attcatcctt ttgcatatga 
agatatgtgt atttatttaa 
tatataactt gttgaatttg 

35 ggtaccgagc tcgaattcac 
attgtcatta gattgttttt 
atcaaacgga tgttaattca 
agcgtcaatt tgtttacacc 
cggcagctcg gcacaaaatc 

40 gcttgcgtca tgcggtcgct 
ggaacgcatg aaggttatcg 
cgcaacccat ctagcccgcg 
cgatccccag ggcagtgccc 
tgtcggcatc gaccgcccga 

45 cgtagtgatc gacggagcgc 
cgacttcgtg ctgattccgg 
ggtggagctg gttaagcagc 
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ggcgggatag tctgccagtt 
caaattttgg tcatatatta 
gaaaagcaat tgctatatac 
ttactactaa ttaatgataa 
acacttttcc cggcaataac 
tgatgctcca tcacttcctg 
aaacgcagca cgatacgctg 
cagacgttgc ccgcataatt 
ggcacagcaa ttgcccggct 
cagttttcgc gatccagact 
ggggtttcta caggacgtac 
cgggctcgac tctagatgaa 
ggacttatga ggatttcgag 
gatctcgatt tggttaatcc 
caatttcatc taacggtaga 
gattggatga gtgtagggtc 
gaatattagc caatcgaagt 
gaagataata atgattgaaa 
gagtggtctt caaattattg 
aaattttact tattttaaag 
gcatttgact tgcagtttca 
tcttattact tttttcgtaa 
aaaattaggg ttttctacaa 
actcgtaagt tacaaaaaaa 
accatgtact ttcatgtact 
acccaatcta taaaattctg 
taaaattaat attacatgtt 
taaaaatgtt gacccatcac 
ggggataatt- gatgaattaa 
tgtatgtaaa cctatgtgat 
taagcaaaga tgattgttct 
gaaattgtgt tagcgtacgc 
ggtgactttt gttaatgata 
agtataagct atcaggatcc 
tggccgtcgt tttacaacga 
atgcatagat gcactcgaaa 
gtacattaaa gacgtccgca 
acaatatatc ctgccaccag 
accacgcgtc taaaaaggtg 
gcgtatatga tgcgatgagt 
ctgtacttaa ccagaaaggc 
ccctgcaact cgccggggcc 
gcgattgggc ggccgtgcgg 
cgattgaccg cgacgtgaag 
cccaggcggc ggacttggct 
tgcagccaag cccttacgac 
gcattgaggt cacggatgga 
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cagttcgttg 


ttcacacaaa 


5460 


gaaaagttat 


aaattaaaat 


5520 


tacattcttt 


tattttgaaa 


5580 


ttattatata 


tatatcaaag 


5640 


atacggcgtg 


acatcggctt 


5700 


attattgacc 


cacactttgc 


5760 


gcctgcccaa 


cctttcggta 


5820 


acgaatatct 


gcatcggcga 


5880 


ttcttgtaac 


gcgctttccc 


5940 


gaatgcccac 


aggccgtcga 


6000 


catggtcgat 


cgactctaga 


6060 


atcgaaattc 


agagttttga 


6120 


tatttcaaga 


gatggtactt 


6180 


agtatctcgc 


ggtgtatgga 


6240 


gaatgatgta 


attagataag 


6300 


catcttatct 


tgataagtgg 


6360 


tccatatcac 


catcatcatc 


6420 


ttttggtaga 


ttttattttt 


6480 


aacgtgacag 


attcacaaga 


6540 


gtatctctat 


ttaatttctt 


6600 


tgctcatagt 


catatacgtt 


6660 


ta.caatttta 


taattttatc 


6720 


aacaaaaaaa 


tcttctaatt 


6780 


tacaaatgaa 


cccacatgta 


6840 


caattcaccc 


tatactcttt 


6900 


tccatttgac 


caaattcaat 


6960 


cactatttct 


cgatttgagg 


7020 


tacaaatatg 


ttatagttta 


7080 


gtaaacatga 


ttcttcttat 


7140 


tgatgttatt 


ggttgattga 


7200 


tatgaggtaa 


tttgttactc 


7260 


aaaacaatag 


agaacataaa 


7320 


ttgtagtatc 


tatacattta 


7380 


gggggatcct 


ctagagtcga 


7440 


ctcagtactg 


cttggtaata 


7500 


tcagccaatt 


ttagacaagt 


7560 


atgtgttatt 


aagttgtcta 


7620 


ccagccaaca 


gctccccgac 


7680 


atgtgtattt 


gagtaaaaca 


7740 


aaataaacaa 


atacgcaagg 


7800 


gggtcaggca 


agacgaccat 


7860 


gatgttctgt 


tagtcgattc 


7920 


gaagatcaac 


cgctaaccgt 


7980 


gccatcggcc 


ggcgcgactt 


8040 


gtgtccgcga 


tcaaggcagc 


8100 


atatgggcca 


ccgccgacct 


8160 


aggctacaag 


cggcctttgt 


8220 
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cgtgtcgcgg gcgatcaaag 
gtacgagctg cccattcttg 
cgccgccggc acaaccgttc 
ggcgctggcc gctgaaatta 
5 agcaaaagca caaacacgct 
acgttggcca gcctggcaga 
gaggatcaca ccaagctgaa 
ctatctgaat acatcgcgca 
aattttagcg gctaaaggag 

10 ggaatgcccc atgtgtggag 
ccggccctgc aatggcactg 
ccatccggcc cggtacaaat 
gccgcgcagg ccgcccagcg 
aaggggccgc tgatcgaatc 

15 cgattaggaa gccgcccaag 
acgtgggcac ccgcgatagt 
gtgaccgacg agctggcgag 
ccgcagggcc ggccggcatg 
cccatctaac cgaatccatg 

20 tgttccgtcc acacgttgcg 
agaaagacga cctggtagaa 
gtacgaagaa ggccaagaac 
gccgctacaa gatcgtaaag 
ctgattggat gtaccgcgag 

25 ccgattactt tttgatcgat 
ccgcaggcaa ggcagaagcc 
ccggagagtt caagaagttc 
cggagtacga tttgaaggag 
gcaacctgat cgagggcgaa 

30 aa'attgccct agcaggggaa 
ttgggaaccc aaagccgtac 
acattgggaa ccggtcacac 
ccgcctaaaa ctctttaaaa 
tgtctggcca gcgcacagcc 

35 ccctacgccc cgccgcttcg 
agatgcgtaa ggagaaaata 
ctgcgctcgg tcgttcggct 
ttatccacag aatcagggga 
gccaggaacc gtaaaaaggc 

40 gagcatcaca aaaatcgacg 
taccaggcgt ttccccctgg 
accggatacc tgtccgcctt 
tgtaggtatc tcagttcggt 
cccgttcagc' ccgaccgctg 

45 agacacgact tatcgccact 
gtaggcggtg ctacagagtt 
gtatttggta tctgcgctct 
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gcacgcgcat cggcggtgag 
agtcccgtat cacgcagcgc 
ttgaatcaga acccgagggc 
aatcaaaact catttgagtt 
aagtgccggc cgtccgagcg 
cacgccagcc atgaagcggg 
gatgtacgcg gtacgccaag 
gctaccagag taaatgagca 
gcggcatgga aaatcaagaa 
gaacgggcgg ttggccaggc 
gaacccccaa gcccgaggaa 
cggcgcggcg ctgggtgatg 
gcaacgcatc gaggcagaag 
cgcaaagaat cccggcaacc 
ggcgacgagc aaccagattt 
cgcagcatca tggacgtggc 
gtgatccgct acgagcttcc 
gccagtgtgt gggattacga 
aaccgatacc gggaagggaa 
gacgtactca agttctgccg 
acctgcattc ggttaaacac 
ggccgcctgg t.gacggtatc 
agcgaaaccg ggcggccgga 
atcacagaag gcaagaaccc 
cccggcatcg gccgttttct 
agatggttgt tcaagacgat 
tgtttcaccg tgcgcaagct 
gaggcggggc aggctggccc 
gcatccgccg gttcctaatg 
aaaggtcgaa aaggtctctt 
attgggaacc ggaacccgta 
atgtaagtga ctgatataaa 
cttattaaaa ctcttaaaac 
gaagagctgc aaaaagcgcc 
cgtcggccta tcgcggccta 
ccgcatcagg cgctcttccg 
gcggcgagcg gtatcagctc 
taacgcagga aagaacatgt 
cgcgttgctg gcgtttttcc 
ctcaagtcag aggtggcgaa 
aagctccctc gtgcgctctc 
tctcccttcg ggaagcgtgg 
gtaggtcgtt cgctccaagc 
cgccttatc'c ggtaactatc 
ggcagcagcc actggtaaca 
cttgaagtgg tggcctaact 
gctgaagcca gttaccttcg 
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gttgccgagg 


cgctggccgg 


o o o r\ 

828 0 


gtgagctacc 


caggcactgc 


8340 


gacgctgccc 


gcgaggtcca 


8400 


aatgaggtaa 


agagaaaatg 


84 60 


cacgcagcag 


caaggctgca 


8520 


tcaactttca 


gttgccggcg 


8580 


gcaagaccat 


taccgagctg 


8640 


aatgaataaa 


tgagtagatg 


8700 


caaccaggca 


ccgacgccgt 


87 60 


gtaagcggct 


gggttgtctg 


8820 


tcggcgtgag 


cggtcgcaaa 


8880 


acctggtgga 


gaagttgaag 


8940 


acgccccggt 


gaatcgtggc 


9000 


gccggcagcc 


ggtgcgccgt 


c\ r\ c r\ 

90 60 


tttcgttccg 


atgctctatg 


9120 


cgttttccgt 


ctgtcgaagc 


9180 


agacgggcac 


gtagaggttt 


9240 


cctggtactg 


atggcggttt 


9300 


gggagacaag 


cccggccgcg 


9360 


gcgagccgat 


ggcggaaagc 


9420 


cacgcacgtt 


gccatgcagc 


9480 


cgagggtgaa 


gccttgatta 


9540 


gtacatcgag 


atcgagcta'g 


9600 


ggacgtgctg 


acggttcacc 


9660 


ctaccgcctg 


gcacgccgcg 


9720 


ctacgaacgc 


agtggcagcg 


9780 


gatcgggtca 


aatgacctgc 


9840 


gatcctagtc 


atgcgctacc 


9900 


tacggagcag 


atgctagggc 


9960 


tcctgtggat 


agcacgtaca 


10020 


cattgggaac 


ccaaagccgt 


10080 


agagaaaaaa 


ggcgattttt • 


10140 


ccgcctggcc 


tgtgcataac 


10200 


tacccttcgg 


tcgctgcgct 


102 60 


tgcggtgtga 


aataccgcac 


10320 


cttcctcgct 


cactgactcg 


10380 


actcaaaggc 


ggtaatacgg 


10440 


gagcaaaagg 


ccagcaaaag 


10500 


ataggctccg 


cccccctgac 


105 60 


acccgacagg 


actataaaga 


10620 


ctgttccgac 


cctgccgctt 


10680 


cgctttctca 


tagctcacgc 


10740 


tgggctgtgt 


gcacgaaccc 


10800 


gtcttgagtc 


caacccggta 


10860' 


ggattagcag 


agcgaggtat 


10920 


acggctacac 


tagaaggaca 


10980 


gaaaaagagt 


tggtagctct 


11040 
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tgatccggca 
acgcgcagaa 
cagtggaacg 
tgtagggctt 
5 tgtgagcaat 
gtcggcttga 
ttcacgtagt 
tgtccaagat 
tccattgccc 

10 caaatgcggg 
ttccatagcg 
aagagttcct 
aagatagcca 
cgctgccatt 

15 tcgtgcacaa 
gaagtttcca 
accgtaacca 
tacaaatgta 
gatagttgag 

20 gggcgactgc 
gcgtaacgcg 
taacaagcca 



aacaaaccac 
aaaaaggatc 
aaaactcacg 
attatgcacg 
tatgtgctta 
acgaatttct 
ggacaaattc 
aagcctgtct 
agtcggcagc 
acaacgtaag 
ttaaggtttc 
ccgccgctgg 
gatcaatgtc 
ctccaaattg 
caatggtgac 
aaaggtcgtt 
gcaaatcaat 
cggccagcaa 
tcgatacttc 
cctgctgcgt 
cttgctgctt 
tgaaaaccgc 



cgctggtagc 
tcaagaagat 
ttaagggatt 
cttaaaaata 
gtgcatctaa 
agctagacat 
ttccaactga 
agcttcaagt 
gacatccttc 
cactacattt 
atttagcgcc 
acctaccaag 
gatcgtggct 
cagttcgcgc 
ttctacagcg 
gatcaaagct 
atcactgtgt 
cgtcggttcg 
ggcgatcacc 
aacatcgttg 
ggatgcccga 
cactgcgttc 



ggtggttttt 
cctttgatct 
ttggtcatgc 
ataaaagcag 
cgcttgagtt 
tatttgccga 
tctgcgcgcg 
atgacgggct 
ggcgcgattt 
cgctcatcgc 
tcaaatagat 
gcaacgctat 
ggctcgaaga 
ttagctggat 
cggagaatct 
cgccgcgttg 
ggcttcaggc 
agatggcgct 
gcttccccca 
ctgctccata 
ggcatagact 
catg 



ttgtttgcaa 
tttctacggg 
atgatatatc 
acttgacctg 
aagccgcgcc 
ctaccttggt 
aggccaagcg 
gatactgggc 
tgccggttac 
cagcccagtc 
cctgttcagg 
gttctcttgc 
tacctgcaag 
aacgccacgg 
cgctctctcc 
tttcatcaag 
cgccatccac 
cgatgacgcc 
tgatgtttaa 
acatcaaaca 
gtaccccaaa 



gcagcagatt 
gtctgacgct 
tcccaatttg 
atagtttggc 
gcgaagcggc 
gatctcgcct 
atcttcttct 
cggcaggcgc 
tgcgctgtac 
gggcggcgag 
aaccggatca 
ttttgtcagc 
aatgtcattg 
aatgatgtcg 
aggggaagcc 
ccttacggtc 
tgcggagccg 
aactacctct 
ctttgtttta 
tcgacccacg 
aaaacagtca 



11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12334 



25 <210> 16 

<211> 11385 

<212> DNA 

<213> STPT GUS ptx-P daao 



30 



<220> 
<221> 
<222> 
<223> 



.mis cofeature 
(38) - . (183) 

Agrobacterium Right Border 



35 



<220> 
<221> 
<222> 
<223> 



terminator 
(384) . . (639) 
compl ementary : 



nos terminator 



40 



<220> 
<221> 
<222> 
<223> 



misc_f eature 
(716) . . (1822) 
complementary : 



coding for Rhodotorula gracilis DAAO 



45 <220> 

<221> promoter 
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<222> (1866) . . (2728) 

<223> complementary: ptxA promoter 
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<220> 

<221> terminator 

<222> (2818) . . (3022) 

<223> complementary: 35 S terminator 
<220> 

<221> misc_f eature 

<222> (3097) . . (5094) 

<223> complementary: coding for beta-glucuronidase (comprising intron) 



15 



<220> 
<221> 
<222> 
<223> 



promoter 
(5148) . . (6465) 

complementary: sTPT promoter 



20 



<220> 
<221> 
<222> 
<223> 



mis cofeature 
(6537) . . (6753) 
Agrobacterium .Left Border 



<400> 16 

25 aatattcaaa caaacacata cagcgcgact 

aaaccttttc acgccctttt aaatatccga 

tttacccgcc aatatatcct gtcaaacact 

atcagatctg gtaccttgcg ccgggtaccc 

ttaccctgtt atccctagtc gagcggccgc 

30 ttttagatca gcacactggc ggccgttact 

aacgactcag agcttgacag gaggcccgat 

atttatccta gtttgcgcgc tatattttgt 

ggactctaat cataaaaacc catctcataa 

catgcttaac gtaattcaac agaaattata 

35 tcaatcttaa gaaactttat tgccaaatgt 

gcgggaactc cacgaaaata tccgaacgca 

acttcgactc ccgcgccgcg ccgtggtacc 

catcctccgc cgcgccccaa ctctgctggt 

caagcgtgac ctccttctcc ttcgccgctc 

40 actttgtccg gtcgagaggc aggacgatcc 

gtgcaggtcg caagccgacg ttgtggcgga 

tcgagatggt cgggtcgagg cgcaagcagt 

tgacagacaa gtcccagtct cccacgccgt 

gtcggggaat gatgtaggcg ggagaagcgg 

45 atggggactt gacgaggacg gtttgcccgc 

ccgcaatcga cttggcgcca agtcccgtag 

cctgctcaag cgacgtaacg gtccgtctct 



tatcatggac 


atacaaatgg 


acgaacggat 


60 


ttattctaat 


aaacgctctt 


ttctcttagg 


120 


gatagtttaa 


actgaaggcg 


ggaaacgaca 


180 


caaactgtct 


cacgacgttt 


tgaacccaga 


240 


cagtgtgatg 


gatatctgca 


gaattcgccc 


300 


agtggatcaa 


ttcactggcc 


gtcgttttac 


360 


ctagtaacat 


agatgacacc 


gcgcgcgata 


420 


tttctatcgc 


gtattaaatg 


tataattgcg 


480 


ataacgtcat 


gcattacatg 


ttaattatta 


540 


tgataatcat 


cgcaagaccg 


gcaacaggat 


600 


ttgaacgatc 


ggggatcatc 


cgggtctgtg 


660 


gcaagatcta 


gagcttgggt 


cccgcctaca 


720 


gctggaacgc 


ctcgtcgacg 


agctgcgcga 


780 


atcccgcact 


cgagaagcca 


tacgcatgca 


840 


gtgcgctgcc 


cctgccgagc 


gagaggggcg 


900 


gttctgcctc 


aacgcggggt 


ccgcctcgtc 


960 


ggacctcgat 


gccttcgatc 


gttccgtcgc 


1020 


gcttgaggat 


ccgctggacc 


gtctctgggt 


1080 


acgtcccgcc 


gcagatgact 


tcgccacctg 


1140 


ggtcggacga 


gtccatcgtg 


catcgcttgc 


1200 


ggattggctc 


ggcggcttgg 


tcgtcgatgc 


1260 


cgttgaccac 


caaatccgca 


ccgtcgaacg 


1320 


caaacgtcgc 


gccgagcttc 


tgcagctctc 


1380 
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ttgcaaggta ctggcagtac tttggtgcgt 
tagcgccagg tggacattcg gaagatggga 
accagtgccc gagcaagccg tcttcgttct 
tggcatggcc cgtcgggacc aactcgaccc 
5 cttgtcgagg accgtctgta agcgtcatga 
aagcgaaagt ctggctcgag acgtcctccg 
agcccttccg agcgaggatg agggcgctgc 
cgacgcgctt ctgcgagtgc atgggccctc 
ctcgagtttc gaagatttta gtgtaatgtg 

10 aaaaaataga atgagtgatg aggtttatat 
tgatatatct ttctcgtgag tcatattcac 
tattcatttt ttatgttaaa tcattgattg 
agttagaatg aattcaaaca aactaggcca 
tcttaagaat ttgtttattt ttttcatgga 

15 actagaggac atttttttaa catgtagtga 
aaggggcaat cttaatcttt ttttcataaa 
agactttgtg ctaaactatt ggtaatttct 
tatgtatgaa atgtgtcatc ctaaaaacaa 
tatgcttaat ggaagaagca atatgttgat 

20 gagtatgtaa ttgacaacta tgattttata 
gtgtgtgggt ttcaaagcat gttatgctag 
acattcatcc ttgaatggag ggacaaactt 
atcctccctc agcttcacaa aaaattgcct 
ctgggttcaa aacgtcgtga gacagtttgg 

25 tcactggatt ttggttttag gaattagaaa 
aatacatact aagggtttct tatatgctca 
attcccttat ctgggaacta ctcacacatt 
tgtagagaga gactggtgat ttttgcgccg 
tgtagccgac gahggtgcgc caggagagtt 

30 tttttcaccg aagttcatgc cagtccagcg 
tttgcggtcg cgagtgaaga tccctttctt 
ggtcgcaaaa tcggcgaaat tccatacctg 
gacgcggtga tacatatcca gccatgcaca 
cattgagtgc agcccggcta acgtatccac 

35 cagtttctcc tgccaggcca gaagttcttt 
gccgctttgg acataccatc cgtaataacg 
gatggtatcg gtgtgagcgt cgcagaacat 
gtcgagttta cgcgttgctt ccgccagtgg 
atccggttcg ttggcaatac tccacatcac 

40 cagctcttta atcgcctgta agtgcgcttg 
gtacagttct ttcggcttgt tgcccgcttc 
gacagcagca gtttcatcaa tcaccacgat 
ttcagcgtaa gggtaatgcg aggtacggta 
gtggtcgtgc accatcagca cgttatcgaa 

45 acgaccaaag ccagtaaagt agaacggttt 
tgccactgac cggatgccga cgcgaagcgg 
tgtgacgcac agttcataga gataaccttc 



ggacggagag 


ggtgtcgtag 


gttacgccga 


1440 


gggggcggta 


atttggcgtg 


atgtccttgt 


1500 


gcgcgaaccg 


cctcgtcccc 


ttgagccaca 


1560 


acttcttgaa 


agtcgattct 


tcccattttg 


1620 


aaggcgtcca 


attcgcgcca 


gcccatggtg 


1680 


gcaagtcgcg 


cgcgagaata 


tgcacgctgt 


1740 


tcagaccgat 


aacgcctgat 


ccgaggacaa 


1800 


gactagagtc 


gagatccgat 


atcgcccggg 


' 1860 


tgtgctcact 


actatgaagc 


tttgcactta 


1920 


ggtgaaaaaa 


actatgaaat 


tttgatattt 


1980 


ggaccatgtt 


gcagcaaatt 


ggaattaaac 


2040 


atttttagtg 


ggcctcgtta 


catattcaag 


2100 


gaaaaaagga 


tgtggggcca 


tttttttgtg 


2160 


taaggggaat 


caatggaaaa 


agtttgatgt 


2220 


caagtagtgc 


tattattcga 


cccgtgatga 


2280 


tctgcacatg 


tgatgcttta 


attatgcttt 


2340 


ttttgtaatc 


gaatcaagta 


tcttttaaac 


2400 


cattttgcta 


gttttagact 


ttgatgttta 


2460 


gtttattggg 


taaaagaaag 


ggacttgatt 


2520 


ttggatttga 


tattcctaac 


attaatttaa 


2580 


tgattcttgt 


gtttgatgct 


tgaaaaatct 


2640 


.tgaatgactt 


ttgaataggt 


gtaaaatcca 


2700 


cgaggtacct 


ggtagggata 


acagggtaat 


2760 


tgcaggtcga 


aattcgagct 


cggtacccgg 


2820 


ttttattgat 


agaagtattt 


tacaaataca 


2880 


acacatgagc 


gaaaccctat 


aagaacccta 


2940 


attctggaga 


aaaatagaga 


gagatagatt 


3000 


ggtaccgagc 


tcggtagcaa 


ttcccgaggc 


3060 


gttgattcat 


tgtttgcctc 


cctgctgcgg 


3120 


tttttgcagc 


agaaaagccg 


ccgacttcgg 


3180 


gttaccgcca 


acgcgcaata 


tgccttgcga 


3240 


ttcaccgacg 


acggcgctga 


cgcgatcaaa 


3300 


ctgatactct 


tcactccaca 


tgtcggtgta 


3360 


gccgtattcg 


gtgatgataa 


tcggctgatg 


3420 


ttccagtacc 


ttctctgccg 


tttccaaatc 


3480 


gttcaggcac 


agcacatcaa 


agagatcgct 


3540 


tacattgacg 


caggtgatcg 


gacgcgtcgg 


3600 


cgaaatattc 


ccgtgcactt 


gcggacgggt 


3660 


cacgcttggg 


tggtttttgt 


cacgcgctat 


3720 


ctgagtttcc 


ccgttgactg 


cctcttcgct 


3780 


gaaaccaatg 


cctaaagaga 


ggttaaagcc 


3840 


gccatgttca 


tctgcccagt 


cgagcatctc 


3900 


ggagttggcc 


ccaatccagt 


ccattaatgc 


3960 


tcctttgcca 


cgtaagtccg 


catcttcatg 


4020 


gtggttaatc 


aggaactgtt 


cgcccttcac 


4080 


gtagatatca 


cactctgtct 


ggcttttggc 


4140 


acccggttgc 


cagaggtgcg 


gattcaccac 


4200 
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ttgcaaagtc ccgctagtgc 
ttcaacgctg acatcaccat 
ttgcgcgaca tgcgtcacca 
cattacgctg cgatggattc 
5 cttgccgttt tcgtcggtaa 
gttcacacaa acggtgatac 
taaattaaaa tatacacact 
ttattttgaa aaaaatattt 
atatatcaaa ggtagaagca 

10 gacatcggct tcaaatggcg 
ccacactttg ccgtaatgag 
acctttcggt ataaagactt 
tgcatcggcg aactgatcgt 
cgcgctttcc caccaacgct 

15 caggccgtcg agttttttga 
tcgactctag actagtggat 
cagagttttg atagtgagag 
agatggtact tgttgatcgg 
cggtgtatgg agttatggta 

20 aattagataa gaatcttgag 
ttgataagtg gatggttttt 
ccatcatcat ctgtataatt 
attttatttt tcattattta 
gattcacaag aaagtagatt 
.25 tttaatttct tttgtttatg 
tcatatacgt tattctaggc 
ataattttat caaagttata 
atcttctaat' tttttttgtt 
ac.ccacatgt attatgcgtt 

30 ctatactctt tttttttttt 
ccaaattcaa ttaatttctg 
tcgatttgag ggaacccgag 
gttatagttt atacttaata 
attcttctta tgaagttgat 

35 tggttgattg agtgattatt 
atttgttact cattcatcct 
gagaacataa aagatatgtg 
ctatacattt atatataact 
tctagagtcg aggtaccgag 

40 gcttggtaat aattgtcatt 
tttagacaag tatcaaacgg 
taagttgtct aagcgtcaat 
agctccccga ccggcagctc 
tgagtaaaac agcttgc'gtc 

45 aatacgcaag gggaacgcat 
aagacgacca tcgcaaccca 
ttagtcgatt ccgatcccca 



32/34 

cttgtccagt tgcaaccacc 
tggccaccac ctgccagtca 
cggtgatatc gtccacccag 
cggcatagtt aaagaaatca 
tcaccattcc cggcgggata 
ctgcacatca acaaattttg 
tataaactac agaaaagcaa 
gaaatattat attactacta 
gaaacttacg tacacttttc 
tatagccgcc ctgatgctcc 
tgaccgcatc gaaacgcagc 
cgcgctgata ccagacgttg 
taaaactgcc tggcacagca 
gaccaattcc acagttttcg 
tttcacgggt tggggtttct 
ccgatatcgc ccgggctcga 
caaagaggga cggacttatg 
acggctacga tgatctcgat 
gggttaatgg tcaatttcat 
atactggttt agattggatg 
agagacacag tgaatattag 
ttgttttttt ggaagataat 
ccttgtatgt tgagtggtct 
ttttataaat gaaattttac 
gttgtctgtc agcatttgac 
ttttttgaat atcttattac 
caactataac taaaattagg 
gtagccagtt tactcgtaag 
taactaggat taccatgtac 
tttctagttc cacccaatct 
taattgcgat ttaaaattaa 
tttaaatatg ataaaaatgt 
gtggtgtttt tggggataat 
tgagtgatta ttgtatgtaa 
gtattagtat gtaagcaaag 
tttgcatatg agaaattgtg 
tatttattta aggtgacttt 
tgttgaattt gagtataagc 
ctcgaattca ctggccgtcg 
agattgtttt tatgcataga 
atgttaattc agtacattaa 
ttgtttacac cacaatatat 
ggcacaaaat caccacgcgt 
atgcggtcgc tgcgtatatg 
gaaggttatc gctgtactta 
tctagcccgc gccctgcaac 
gggcagtgcc cgcgattggg 
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tgttgatccg 


catcacgcag 


4260 


acagacgcgt 


ggttacagtc 


4320 


gtgttcggcg 


tggtgtagag 


4380 


tggaagtaag 


actgcttttt 


4440 


gtctgccagt 


tcagttcgtt 


4500 


gtcatatatt 


agaaaagtta 


4560 


ttgctatata 


ctacattctt 


4620 


attaatgata 


attattatat 


4680 


ccggcaataa 


catacggcgt 


4740 


atcacttcct 


gattattgac 


4800 


acgatacgct 


ggcctgccca 


4860 


cccgcataat 


tacgaatatc 


4920 


attgcccggc 


tttcttgtaa 


4980 


cgatccagac 


tgaatgccca 


5040 


acaggacgta 


ccatggtcga 


5100 


ctctagatga 


aatcgaaatt 


5160 


aggatttcga 


gtatttcaag 


5220 


ttggttaatc 


cagtatctcg 


5280 


ctaacggtag 


agaatgatgt 


5340 


agtgtagggt 


ccatcttatc 


5400 


ccaatcgaag 


ttccatatca 


5460 


aatgattgaa 


attttggtag 


5520 


tcaaattatt gaacgtgaca 


5580 


ttattttaaa 


ggtatctcta 


5640 


ttgcagtttc 


atgctcatag 


5700 


ttttttcgta 


atacaatttt 


5760 


gttttctaca 


aaacaaaaaa 


5820 


ttacaaaaaa 


atacaaatga 


5880 


tttcatgtac 


tcaattcacc 


5940 


ataaaattct 


gtccatttga 


6000 


tattacatgt- 


tcactatttc 


6060 


tgacccatca 


ctacaaatat 


6120 


tgatgaatta 


agtaaacatg 


6180 


acctatgtga 


ttgatgttat 


6240 


atgattgttc 


ttatgaggta 


6300 


ttagcgtacg 


caaaacaata 


6360 


tgttaatgat 


attgtagtat 


6420 


tatcaggatc 


cgggggatcc 


6480 


ttttacaacg 


actcagtact 


6540 


tgcactcgaa 


atcagccaat 


6600 


agacgtccgc 


aatgtgttat 


6660 


cctgccacca 


gccagccaac 


6720 


ctaaaaaggt 


gatgtgtatt 


6780 


atgcgatgag 


taaataaaca 


6840 


accagaaagg 


cgggtcaggc 


6900 


tcgccggggc 


cgatgttctg 


6960 


cggccgtgcg 


ggaagatcaa 


7020 



WO 2005/090582 



PCT/EP2005/002735 



ccgctaaccg ttgtcggcat cgaccgcccg 
cggcgcgact tcgtagtgat cgacggagcg 
atcaaggcag ccgacttcgt gctgattccg 
accgccgacc tggtggagct ggttaagcag 
5 gcggcctttg tcgtgtcgcg ggcgatcaaa 
gcgctggccg ggtacgagct gcccattctt 
ccaggcactg ccgccgccgg cacaaccgtt 
cgcgaggtcc aggcgctggc cgctgaaatt 
aagagaaaat gagcaaaagc acaaacacgc 

10 gcaaggctgc aacgttggcc agcctggcag 
agttgccggc ggaggatcac accaagctga 
ttaccgagct gctatctgaa tacatcgcgc 
atgagtagat gaattttagc ggctaaagga 
accgacgccg tggaatgccc catgtgtgga 

15 tgggttgtct gccggccctg caatggcact 
gcggtcgcaa accatccggc ccggtacaaa 
agaagttgaa ggccgcgcag gccgcccagc 
tgaatcgtgg caaggggccg ctgatcgaat 
cggtgcgccg tcgattagga agccgcccaa 

20 gatgctctat gacgtgggca cccgcgatag 
tctgtcgaag cgtgaccgac gagctggcga 
cgtagaggtt tccgcagggc cggccggcat 
gatggcggtt tcccatctaa ccgaatccat 
gcccggccgc gtgttccgtc cacacgttgc 

25 tggcggaaag cagaaagacg acctggtaga 
tgccatgcag cgtacgaaga aggccaagaa 
agccttgatt agccgctaca agatcgtaaa 
gatcgagcta gctgattgga tgtaccgcga 
gacggttcac cccgattact ttttgatcga 

30 ggcacgccgc gccgcaggca aggcagaagc 
cagtggcagc gccggagagt tcaagaagtt 
aaatgacctg ccggagtacg atttgaagga 
catgcgctac cgcaacctga tcgagggcga 
gatgctaggg caaattgccc tagcagggga 

35 tagcacgtac attgggaacc caaagccgta 
cccaaagccg tacattggga accggtcaca 
aggcgatttt tccgcctaaa actctttaaa 
ctgtgcataa ctgtctggcc agcgcacagc 
gtcgctgcgc tccctacgcc ccgccgcttc 

40 aaataccgca cagatgcgta aggagaaaat 
tcactgactc gctgcgctcg gtcgttcggc 
cggtaatacg gttatccaca gaatcagggg 
gccagcaaaa ggccaggaac cgtaaaaagg 
'gcccccctga cgagcatcac aaaaatcgac 

45 gactataaag ataccaggcg tttccccctg 
ccctgccgct taccggatac ctgtccgcct 
atagctcacg ctgtaggtat ctcagttcgg 



acgattgacc 


gcgacgtgaa 


ggccatcggc 


7 080 


ccccaggcgg 


cggacttggc 


tgtgtccgcg 


714 0 


gtgcagccaa 


gcccttacga 


catatgggcc 


7200 


cgcattgagg 


tcacggatgg 


aaggctacaa 


lorn 

72 60 


ggcacgcgca 


tcggcggtga 


ggttgccgag 


7320 


gagtcccgta 


tcacgcagcg 


cgtgagctac 


7380 


cttgaatcag 


aacccgaggg 


cgacgctgcc 


744 0 


aaatcaaaac 


tcatttgagt 


taatgaggta 


7500 


taagtgccgg 


ccgtccgagc 


gcacgcagca 


7560 


acacgccagc 


catgaagcgg 


gtcaactttc 


7 620 


agatgtacgc 


ggtacgccaa 


ggcaagacca 


7 680 


agctaccaga 


gtaaatgagc 


aaatgaataa 


7740 


ggcggcatgg 


aaaatcaaga 


acaaccaggc 


n O n n 

7 8 00 


ggaacgggcg 


gttggccagg 


cgtaagcggc 


78 60 


ggaaccccca 


agcccgagga 


atcggcgtga 


7920 


tcggcgcggc 


gctgggtgat 


gacctggtgg 


7980 


ggcaacgcat 


cgaggcagaa 


gacgccccgg 


804 0 


ccgcaaagaa 


tcccggcaac 


cgccggcagc 


8100 


gggcgacgag 


caaccagatt 


ttttcgttcc 


8160 


tcgcagcatc 


atggacgtgg 


ccgttttccg 


8220 


ggtgatccgc 


tacgagcttc 


cagacgggca 


8280 


ggccagtgtg 


tgggattacg 


acctggtact 


8340 


gaaccgatac 


cgggaaggga 


agggagacaa 


8400 


ggacgtactc 


aagttctgcc 


ggcgagccga 


84 60 


aacctgcatt 


cggttaaaca 


ccacgcacgt 


8520 


cggccgcctg 


gtgacggtat 


ccgagggtga 


8580 


gagcgaaacc 


gggcggccgg 


agtacatcga ■ 


8640 


gatcacagaa 


ggcaagaacc 


cggacgtgct 


87 00 


tcccggcatc 


ggccgttttc 


tctaccgcct 


87 60 


cagatggttg 


ttcaagacga 


tctacgaacg 


8820 


ctgtttcacc 


gtgcgcaagc 


tgatcgggtc 


8880 


ggaggcgggg 


caggctggcc 


cgatcctagt 


8940 


agcatccgcc 


ggttcctaat 


gtacggagca 


9000 


aaaaggtcga 


aaaggtctct 


ttcctgtgga 


9060 


cattgggaac 


cggaacccgt 


acattgggaa 


9120 


catgtaagtg 


actgatataa 


aagagaaaaa 


9180 


acttattaaa 


actcttaaaa 


cccgcctggc 


9240 


cgaagagctg 


caaaaagcgc 


ctacccttcg 


9300 


gcgtcggcct 


atcgcggcct 


atgcggtgtg 


9360 


accgcatcag 


gcgctcttcc 


gcttcctcgc 


9420 


tgcggcgagc 


ggtatcagct 


cactcaaagg 


9480 


ataacgcagg 


aaagaacatg 


tgagcaaaag 


9540 


ccgcgttgct 


ggcgtttttc 


cataggctcc 


9600 


gctcaagtca 


gaggtggcga 


aacccgacag 


9660 


gaagctccct 


cgtgcgctct 


cctgttccga 


9720 


ttctcccttc 


gggaagcgtg 


gcgctttctc 


9780 


tgtaggtcgt 


tcgctccaag 


ctgggctgtg 


9840 
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tgcacgaacc ccccgttcag 
ccaacccggt aagacacgac 
gagcgaggta tgtaggcggt 
ctagaaggac agtatttggt 
5 ttggtagctc ttgatccggc 
agcagcagat tacgcgcaga 
ggtctgacgc tcagtggaac 
ctcccaattt gtgtagggct 
gatagtttgg ctgtgagcaa 

10 cgcgaagcgg cgtcggcttg 
tgatctcgcc tttcacgtag 
gatcttcttc ttgtccaaga 
ccggcaggcg ctccattgcc 
ctgcgctgta ccaaatgcgg 

15 cgggcggcga gttccatagc 
gaaccggatc aaagagttcc 
cttttgtcag caagatagcc 
gaatgtcatt gcgctgccat 
gaatgatgtc gtcgtgcaca 

20 caggggaagc cgaagtttcc 
'gccttacggt caccgtaacc 
ctgcggagcc gtacaaatgt 
caactacctc tgatagttga 
actttgtttt agggcgactg 

25 atcgacccac ggcgtaacgc 
aaaaacagtc ataacaagcc 



34/34 

cccgaccgct gcgccttatc 
ttatcgccac tggcagcagc 
gctacagagt tcttgaagtg 
atctgcgctc tgctgaagcc 
aaacaaacca ccgctggtag 
aaaaaaggat ctcaagaaga 
gaaaactcac gttaagggat 
tattatgcac gcttaaaaat 
ttatgtgctt agtgcatcta 
aacgaatttc tagctagaca 
tggacaaatt cttccaactg 
taagcctgtc tagcttcaag 
cagtcggcag cgacatcctt 
gacaacgtaa gcactacatt 
gttaaggttt catttagcgc 
tccgccgctg gacctaccaa 
agatcaatgt cgatcgtggc 
tctccaaatt gcagttcgcg 
acaatggtga cttctacagc 
aaaaggtcgt tgatcaaagc 
agcaaatcaa tatcactgtg 
acggccagca acgtcggttc 
gtcgatactt cggcgatcac 
ccctgctgcg taacatcgtt 
gcttgctgct tggatgcccg 
atgaaaaccg ccactgcgtt 
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cggtaactat 


cgtcttgagt 


9900 


cactggtaac 


aggattagca 


9960 


gtggcctaac 


tacggctaca 


10020 


agttaccttc 


ggaaaaagag 


10080 


cggtggtttt 


tttgtttgca 


10140 


tcctttgatc 


ttttctacgg 


10200 


tttggtcatg 


catgatatat 


10260 


aataaaagca 


gacttgacct 


10320 


acgcttgagt 


taagccgcgc 


10380 


ttatttgccg 


actaccttgg 


10440 


atctgcgcgc 


gaggccaagc 


10500 


tatgacgggc 


tgatactggg 


10560 


cggcgcgatt 


ttgccggtta 


10620 


tcgctcatcg 


ccagcccagt 


10680 


ctcaaataga 


tcctgttcag 


10740 


ggcaacgcta 


tgttctcttg 


10800 


tggctcgaag 


atacctgcaa 


108 60 


cttagctgga 


taacgccacg 


10920 


gcggagaatc 


tcgctctctc 


10980 


tcgccgcgtt 


gtttcatcaa 


11040 


tggcttcagg 


ccgccatcca 


11100 


gagatggcgc 


tcgatgacgc 


11160 


cgcttccccc 


atgatgttta 


11220 


gctgctccat 


aacatcaaac 


11280 


aggcatagac 


tgtaccccaa 


11340 


ccatg 




11385 



